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ABSTRACT 


Fixed  configuration  filter  theory  provides  a methodology  for  designing 
filters  of  reduced  complexity  which  will  provide  suboptimal  performance  in 
general,  and  optimal  performance  under  certain  conditions.  In  this  report 
we  derive  fixed  configuration  reduced  order  filters  for  continuous  and  dis- 
crete time  systems,  with  and  without  state  dependent  noise. 
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FIXED  CONFIGURATION  REDUCED  ORDER  FILTERS 

I.  INTRODUCTION 

Fixed  configuration  filter  theory  provides  a methodology  for  designing 
filters  of  reduced  complexity  which  will  provide  near  optimal  performance. 
Sometimes  the  performance  is  optimal,  although  the  filter  structure  is  sim- 
plified. The  simplification  is  achieved  by  specifying  the  filter  structure 
a priori,  and  then  optimizing  the  free  parameters  of  the  filter.  The  advan- 
tage of  such  a filter  over  an  optimal  linear  Kalman  filter  is  that  far  fewer 
on-line  calculations  are  necessary.  This  is  desirable  for  many  applications 
where  the  computational  facilities  are  limited  and/or  the  state  vector  is  so 
large  that  it  is  not  feasible  to  perform  all  the  on-line  calculations  required 
for  Kalman  filtering.  Such  situations  are  common  Air  Force  problems. 

The  fixed  configuration  filter  discussed  in  this  report  is  referred  to 
as  a reduced  order  filter.  It  has  applications  in  several  areas.  One  appar- 
ent area  of  usefulness  occurs  in  aided  navigation  systems  such  as  loran-iner- 
tial  or  doppler-inertial-loran  systems.  Another  area  of  application  is  point- 
ing and  tracking  problems.  The  criterion  for  applicability  of  reduced  order 
filtering  is  to  have  a relatively  high  order  system  with  interest  in  estimat- 
ing only  some  of  the  state  variables.  This  is  often  the  case  when  many  of 
the  state  variables  occur  due  to  a detailed  model  of  the  noise  processes  in 
the  system.  Here,  the  interest  is  not  to  estimate  the  noise  state  variables, 
but  only  their  effect  on  the  variables  of  concern,  such  as  position  and  veloc- 
ity. 
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Much  of  the  previous  work  in  fixed  configuration  filtering  has  suffered 
from  a rather  important  factor.  Although  the  on-line  computational  require- 
ments were  significantly  reduced,  the  off-line  computation  of  the  optimal 
filter  parameters  was  difficult.  Sometimes  the  situation  was  handled  by 
solving  a difficult  nonlinear  two-point  boundary-val ue  problem  (TPBVP)  [1]. 

In  other  cases,  filter  parameters  were  not  truly  optimized,  but  only  optimized 
with  regard  to  the  following  stage  of  estimation,  [2].  In  the  work  presented 
here  we  continue  to  investigate  reduced  order  filtering  problems  with  the 
same  general  context  set  forth  in  [3].  The  solutions  obtained  are  basically 
of  two  types: 

1.  Truly  optimal  solutions  where  the  TPBVP  reduced  to  a single-point 
boundary-val ue  problem. 

2.  Suboptimal  or  partially  optimized  solutions  where  only  a linear 
TPBVP  must  be  solved. 

Both  of  the  above  mentioned  solution  categories  have  the  property  that 
the  amount  of  off-line  computation  required  is  predictable  and  reasonable. 

This  fact  makes  the  methodology  we  are  suggesting  feasible  in  that  the  reduced 
order  filtering  problems  can  be  solved  with  a realistic  amount  of  off-line 
computation,  while  the  on-line  computational  savings  could  be  enormous. 

The  main  contributions  of  this  research  are  presented  in  the  first  four 
papers  appearing  in  the  Appendix.  The  first  two  papers  deal  with  continuous 
time  problems.  The  first  paper  is  entitled  "Reduced  Order  Modeling."  It  has 
been  submitted  to  the  IEEE  Transactions  on  Automatic  Control  as  a short  paper. 

The  second  paper  is  entitled  "Reduced  Order  Filtering  with  State  Dependent 
Noise."  It  has  beeen  submitted  to  the  1978  Joint  Automatic  Control  Confer- 
ence, and  will  also  be  reviewed  for  the  IEEE  Transactions  on  Automatic 
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The  third  and  fourth  papers  are  discrete  time  versions  of  the  reduced 
order  filtering  problems.  These  are  draft  copies,  not  presently  in  the  final 
form  for  submission  to  appropriate  journals.  The  titles  are  "Linear  Discrete 
Reduced  Order  Filtering,"  and  "Discrete  Reduced  Order  Filtering  with  State 
Dependent  Noise."  The  discrete  problems  are  distinctly  different  from  the 
continuous  problems,  and  represent  nontrivial  extensions.  Perhaps  this  fact 
is  one  of  the  more  important  findings  of  the  research.  The  fifth  paper  in 
Appendix  A is  not  pertinent  to  this  research,  but  was  performed  during  the 
duration  of  this  grant. 

The  details  of  the  research  are  to  be  found  in  the  papers  in  the  Appendix. 
A discussion  of  the  papers  and  an  overview  of  the  research  findings  are  pre- 
sented in  the  following  sections  of  this  report.  The  work  presented  herein  is 
a continuation  of  an  investigation  which  began  in  the  summer  of  1976  while  the 
principal  investigator  participated  in  the  USAF/ASEE  sponsored  summer  faculty 
research  program  at  the  Frank  J.  Seiler  Research  Laboratory. 

II.  DISCUSSION 

a)  Reduced  Order  Modeling 

The  paper  "Reduced  Order  Modeling"  presents  a method  for  modeling  a 
linear  stochastic  system  of  high  order,  using  a reduced  order  model.  The 
problem  is  formulated  as  follows.  Given  a linear  stochastic  model 

X (t)  = A (t)  X (t)  + w (t)  (1) 

with  output 

y (t)  = C (t)  X (t)  (2) 


where  w{t)  is  zero  mean  white  noise,  find  a reduced  order  model 
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y (t)  = F (t)  y (t)  + K (t)  w (t)  (3) 

which  approximates  (1)  and  (2).  The  error  criterion  is  quadratic 

J = E{/^e’'’(t)  R(t)  e(t)  dt  + e'^(tf)  Se(tf)l  (4) 

^0 

where  e is  the  equation  error 

e A y - y (5) 

This  problem  was  motivated  by  the  work  of  Obinata  and  Inooka  [4],  but  uses 
much  the  same  mathematics  as  that  in  [3].  It  is  really  a model  matching 
problem,  but  may  be  regarded  as  a peculiar  kind  of  reduced  order  filtering 
problem  where  one  only  measures  the  input  noise,  and  there  are  no  noisy 
observations  of  the  state  vector. 

The  optimal  results  for  this  paper  are  characterized  by  a singular  arc 
which  exists  when  the  matrices  F(t)  and  K(t),  and  the  vector  of  initial 
conditions,  y(tQ)  are  selected  appropriately.  The  correct  choice  of  the 
matrices  is  extremely  simple  when  it  can  be  done,  but  generally  the  algebra 
will  not  have  a solution.  In  this  case  a suboptimal  approach  is  suggested 
where  F(t)  is  selected  a priori  and  only  K(t)  and  y(tg)  are  optimized. 

The  solution  is  obtained  as  a single-point  boundary-value  problem. 

The  value  of  y (tg)  in  this  paper  was  not  constrained  a priori  so  that 
y(t)  would  be  an  unbiased  estimate  of  y (t)  for  t^t^.  It  was  obtained  via 
application  of  the  generalized  boundary  condition  [5]  from  the  calculus  of 
variations.  When  the  optimal  choice  for  F (t)  could  be  obtained,  this 
approach  led  to  the  same  result  as  an  unbiased  constraint.  We  obtained  the 
very  important  result  that  when  F (t)  is  not  selected  optimally,  it  is  better 
to  select  y (tg)  in  such  a way  that  y (t)  is  a biased  estimate  of  y (t). 
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That  is,  the  quadratic  performance  measure  will  be  smaller  if  this  is  done. 

At  this  point  it  appears  that  the  same  remark  will  hold  true  in  the  other 
filtering  problem  which  we  have  solved  with  the  unbiased  constraint  applied 
prior  to  optimization.  This  may  be  an  important  property  pertinent  to  the 
reduced  order  area,  however  we  have  not  gone  through  the  mathematics  to 
generalize  the  result  yet. 

b)  Reduced  Order  Filtering  with  State  Dependent  Noise 
This  paper  represents  an  extension  of  the  basic  theory  set  forth  in  [3] 
to  systems  with  state  dependent  noise  as  described  by  stochastic  equations  of 
the  form 

dx  (t)  = A (t)  X (t)  d t + dw  (t)  + X!  [x,-  (t)  - y.  (t)]  G.  (t)  d v(t) 

i=l  ' ' ' (6) 

where  the  disturbances  are  zero  mean  incremental  wiener  processes  and  y(t)  is 
the  mean  value  of  the  state  vector.  The  observation  vector  is  also  corrupted 
by  state  dependent  noise.  It  is  of  the  form 

dy  (t)  = d V (t)  + C (t)  x-(t)  d t + X"  [x-  (t)  - y^  (t)]  M,-  (t)  d v (7) 

i = l 

A filter  is  to  be  designed  to  estimate  a lower  order  vector 

z (t)  = N (t)  X (t)  (8) 

The  filter  is  of  the  form 

dz(t)  = [F(t)z(t)+g(t)jdt  + K(t)dy(t)  (9) 

and  the  estimate  of  z is  required  to  be  unbiased.  We  select  g(t)  and  z (t^) 
to  meet  the  unbiased  requirement,  and  then  optimize  the  choice  of  F(t)  and 
K(t)  with  respect  to  the  quadratic  performance  criterion 
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J = E {/  e'''(t)  Qe(t)  dt  + e'^(t^)  Se(t^)  } (10) 

t 

0 

The  solution  proceeds  along  the  same  lines  as  that  obtained  in  [3],  with 
different  equations  resulting  from  the  state  dependent  noise.  There  is  an 
interesting  interpretation  to  the  result  obtained  when 

A Z Pxx  (t)  = 0 (11) 

i = l 
j = l 

where  H is  the  covariance  matrix  associated  with  v.  The  solution  is  the  same 
as  that  obtained  in  [3],  but  with  R,  the  covariance  matrix  associated  with  v 
replaced  by  R + and  Q,  the  plant  noise  covariance  matrix  replaced  by  Q + 

where 


f.  A V P G.  H 
^ -i=l  ^ 


A V P M.  E mT  13 

3 -f-  XX.  . 1 j 

1 = 1 1 J 

j = l 

The  analogy  could  be  carried  further,  without  requiring  (11),  if  w and  v 
were  allowed  to  have  non  zero  cross  correlation.  There  are  some  interesting 
aspects  to  what  we  have  said  here.  As  an  example,  one  may  have  meaningful 
problems  with  state  dependent  noise  when  R = o.  Such  problems  would  not  be 
well  posed  without  state  dependent  noise.  It  is  remarkable,  considering  the 
complexity  of  problems  involving  state  dependent  noise,  that  the  results 
obtained  in  [3]  could  be  extended  to  this  class  of  problems  with  such  minor 
modi fi cat ions . 


c)  Linear  Discrete  Reduced  Order  Filtering 


i: 

In  the  proposal,  it  was  stated  that  the  research  area  would  be  pursued  ■ 

i 

to  some  extent  within  a discrete  framework.  This  was  motivated  by  the  fact 

that  the  discrete  format  would  make  it  easy  toevaluate  the  amount  of  on-line 

calculation  required  by  the  reduced  order  filters.  What  we  found  was  that 

the  character  of  the  discrete  problem  was  considerably  different  than  that 

of  the  continuous  time  problem,  and  that  the  extension  of  the  results  was  i 

not  a trivial  exercise.  The  main  reason  for  the  difference  is  that  matrices  : 

which  occur  only  linearly  in  the  continuous  problem  occur  quadratically  in  ■ 

the  discrete  problem. 

The  discrete  problem  is  formulated  using  the  dynamical  model  ^ 

X.  ^ = A X + w. ; j = 0,  1,  . . . (14)  I 

J ^ J J J j 

with  observation  model  i 


y.  , = C X + V.  ; j = 1,  2,  .. 
j + 1 J + 1 J + 1 J +1 


(15) 


where  w-  and  v.  are  zero  mean  white  noise  sequences.  A lower  order  linear 
J J 

transformation  of  the  state  vector 


z . AN.  X . 
J - J J 


is  to  be  estimated  by  the  linear  filter 


(16) 


z.  =F  z.+K.y.  +g. 

J + 1 j J J J + 1 J 


(17) 


The  deterministic  vector  sequence,  q ^ , and  the  filter  initial  condition,  z^, 

are  to  be  selected  so  that  z.  is  an  unbiased  estimate  of  z . . The  matrix 

J J 

sequences,  F.  and  K.  are  then  chosen  to  minimize  a quadratic  performance 
J J 

measure  in  the  error 
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e . A z . - z . 
J - J J 


(18) 


The  performance  measure  is 


T T 

'jii  'j  "j  "j  5'h' 


(19) 


A general  TPBVP  is  specified,  whose  solution  gives  the  required  sequence  of 

matrices,  F.  and  K..  The  interesting  feature  of  the  problem  that  differen- 
J J 

tiates  it  from  the  continuous  time  problem,  is  that  the  matrices  F.  and  K. 

J 

both  appear  quadratical ly  in  the  Hamiltonian.  The  singular  optimization 
problem  is  therefore  not  present  in  the  discrete  case.  This  has  both  good 
and  bad  aspects.  The  good  feature  is  that  one  does  not  have  to  go  through 
the  excessively  tedious  mathematics  of  deriving  conditions  for  a singular  arc. 
The  bad  part  is  that  it  is  not  obvious  how  one  should  proceed  to  solve  the 
TPBVP. 

The  way  that  the  TPBVP  simplifies  is  interesting,  and  closely  related 
to  the  unbiased  requirement.  To  satisfy  the  unbiased  requirement,  it  is 
necessary  to  select 


g . = G . M . 
J J J 


(20) 


where  g.  = E {x.  } and  where 
J J 


G.a(N._,,  - K.  C.^J  A.  - F. 
J-J+1  J J+1  J J 


(21) 


It  turns  out  that  if  one  can  make  G.  = o by  selecting  F appropriately,  then 

J j 

the  TPBVP  simplifies  and  becomes  a single-point  boundary-value  problem.  It 

is  not  always  possible  to  find  an  F.  that  makes  G.  equal  to  zero  however. 

J 'J 

For  this  reason  the  problem  is  considered  where  only  is  optimized  over  the 
entire  interval,  and  F^  is  selected  prior  to  optimization,  perhaps  according 
to  a one-stage  optimization  procedure  [2].  The  important  aspect  of  the 
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result  under  these  circumstances  is  that  one  only  has  to  solve  a linear  TPBVP. 
Such  problems  can  be  solved  using  invariant  imbedding  techniques  with  the 
method  leading  to  a discrete  Riccati  equation.  Some  effort  was  spent  on  a 
computer  program  using  this  approach  during  the  course  of  this  research. 

The  author  feels  that  the  discrete  problem  and  its  solution  represent  a sig- 
nificant aspect  of  the  research  performed  during  the  period  of  this 
grant. 

d)  Discrete  Reduced  Order  Filtering  with  State  Dependent  Noise 
In  view  of  what  was  done  in  the  case  of  continuous  time  systems,  a 
natural  extension  of  the  preceding  paper  is  to  consider  the  case  of  state 
dependent  noise.  The  dynamical  model  considered  is  of  the  form 

n 

x(j+l)  = A(j)  x(j)  + w(j)  + x^(j)  r.  (j)  ri(j)  (22) 

i = l 

where  w(j)  and  ri{j)  are  zero  mean  discrete  white  noise  vectors.  The  obser- 
vation model  is  of  the  form 

n _ 

y (j  + 1)  = C (j  + 1)  X (j  + 1)  + V (j  + 1)  + X]  X (j)  M.  (j)  n(j)  (23) 

i = l ^ 

where  v (j  + 1)  is  the  zero  mean  discrete  white  measurement  noise  that  is  addi- 
tive. In  (22)  and  (23),  the  terms  x.j(j)  are  defined  as  x^.  (j)  - Efx^(j)}. 

We  note  that  often  measurements  of  the  form 

n _ . 

y (j  + 1)  = C (j  + 1)  X (j  + 1)  + V (j  + 1)  + XI  X.  ( j + 1)  M.  (j  + 1)  e(j  + 1) 

i = l ^ 

(24) 

can  be  put  in  the  required  form  (23),  where  r represents  a disturbance  influ- 
encing the  observation  in  a multiplicative  way.  Hence  we  do  not  view  the 
form  of  (23)  as  overly  restrictive.  The  ability  to  treat  nonadditive  noise 
situations  is  a useful  addition  to  the  research  findings.  It  enables  one  to 
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treat  problems  where  the  measurement  is  either  present  or  absent  in  a random 
way,  as  is  exemplified  by  the  inertial  example  problem  considered  in  the 
paper. 

Just  as  in  the  continuous  time  problem,  the  results  carried  over  with 
only  minor  modification  to  the  case  with  state  dependent  noise,  i.e.  algorithms 
were  developed  for  designing  filters  of  the  form 

z(j  + l)  = F(j)  z(j)  + K(j)  y(j+l)  + g(j)  (25) 

where  z(j)  is  a reduced  order  estimate  for  z(j)  = N(3)  x(j).  These  algo- 
rithms differ  in  relatively  minor  ways  from  those  developed  without  state 
dependent  noise. 

All  of  the  papers  discussed  in  this  section  and  presented  in  the  appendix 
deal  with  different  aspects  of  reduced  order  filtering  and  signal  processing. 

In  all  cases  we  have  had  as  a goal,  the  idea  of  avoiding  unrealistic  off-line 
computation,  e.g.  high  order  matrix  valued  non-linear  TPBVP's.  It  is  the 
author's  opinion  that  the  results  are  practical  in  the  sense  that  the  off-line 
computation  is  feasible,  and  the  on-line  computational  savings  may  be  tremen- 
dously important  in  a system  where  estimates  must  be  available  in  a limited 
time  frame,  and  the  dynamical  system  is  of  high  dimension. 

III.  OTHER  TOPICS 

The  most  important  aspects  of  the  research,  in  the  author's  opinion,  have 
been  set  forth  in  the  publications  which  have  been  discussed.  There  are  some 
other  topics  that  we  feel  are  worth  mentioning  however,  and  these  are  discussed 
in  this  section. 

a)  Steady  State  Results 

Consider  the  steady  state  or  stationary  version  of  the  problem  considered 
in  [3].  We  assume  that  there  is  a model  such  as  described  by  (1)  but  with 
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constant  A matrix  and  stationary  noise  w.  The  observation  model  is 

y(t)  = Cx(t)  + v(t)  (26) 

where  v (t)  is  zero  mean  stationary  white  noise.  The  vector  to  be  estimated 
is 

z (t)  = N X (t)  (27) 

and  the  filter  is  a reduced  order  filter,  not  time  variable, 

z (t)  = F z (t)  + Ky  (t)  (28) 

The  optimal  matrices  F and  K are  to  be  found  so  that  the  performance  measure 

J = E { e^  (t)  Ue  (t)}  (29) 

is  minimized,  where  e^z  - z. 

The  problem  is  basically  a calculus  problem,  that  can  be  stated  as 
follows.  Find  the  matrices  F and  K to  minimize 

J = tr  { } (30) 

where 

A E { e (t)  e^(t)  } (31) 

satisfies 

o = BP  +FP  +P  F^+P  B^  + NQN^  + KRK^  (32) 

xe  ee  ee  ex 

In  (32)  B is  defined  as 


Ba(NA-FN-KC) 


(33) 


and  Q is  the  covariance  matrix  associated  with  w while  R is  the  covariance 
matrix  associated  with  v.  The  matrix  is  defined  as 


12 


P = = L { X e^  } ( 34 ) 

xe  ex 

and  satisfies  the  requirement 

where  P is  the  second  moment  matrix 

XX 

P = E { xx^  } (36) 

XX 

which  may  be  regarded  as  known,  and  satisfies  the  constraint 

”"''’xx*'’xx 

We  note  that  there  are  stability  requirements  which  must  be  met  before  these 
equations  are  applicable. 

Augmenting  the  cost  function  with  the  constraints  (32),  (35),  and  the 
transpose  of  (35),  we  have 

J*  = tr  { UPgg  + [BP^g  + FP^^  + PggPT  + P^^B^  + NQN^  + K R ] A 

+ [AP  + P + P + Qn"^]  + [P  a''’  + BP  + FP^^  (38) 

xe  XX  xe  ^ xe  ex  xx  ex  ' ' 

+ N Q ] a"^  } 

ex 


Taking  the  gradients  of  J*  with  respect  to  P and  P setting  them  equal  to 
^ ■ xe  ee 

zero  gives  the  equations  for  the  Lagrange  multipliers 


(39) 
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T T 

o = AA  +A  f + ba 
xe  xe 


(40) 


We  note  that  A = A . Setting  the  gradient  with  respect  to  K equal  to 
ex  xe 


zero  gives 


K = [P  + a'\  P ] C^R'^ 
ex  ee  ex  xx 


(41) 


Since  F appears  linearly  in  the  Hamiltonian,  it  is  a bit  troublesome.  Taking 
the  gradient  of  J*  with  respect  to  F and  setting  it  equal  to  zero  gives  the 
requirement  that 


0a(P  -NP  )A  +(P  -NP  )a^  = o 
- ' ee  xe  ee  ex  xx  xe 


(42) 


It  can  be  seen  that  e will  be  zero  if  A is  zero  and  P - NP  = o. 

xe  ee  ><e 

We  will  show  that  if  F can  be  found  so  that  B is  zero,  i.e. 


NA-FN-KC=o 


(43) 


and  the  required  F is  nonsingular,  then  0 will  equal  zero  when  K is  selected 
optimally.  When  B = o,  (40)  is  solved  by  A = o.  Subtracting  (35) premulti- 

X0 

plied  by  N from  (32)  gives 


(P  - NP  ) F^  + F P„„  - N A P^^  + K R k’’’  = 0 
' ee  xe  ee  xe 


(44) 


Substituting  from  (43)  into  (44)  v;e  obtain 

0 = (P  - NP  )F^  + F(P  -NP  ) + K[Rk’^-CP  ] 
' ee  xe  ee  xe  xe 


(45) 


The  last  term  is  zero  when  K is  optimal,  as  can  be  seen  from  (41)  when  A^^  = o. 
Hence  when  F is  nonsingular,  (45)  implies  that 


P = N P 
ee  xe 


(46) 


and  consequently,  o is  zero.  When  a solution  to  (43)  can  be  found  it  is  of 
the  form 
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F = [NA  - KC]N+'+  r [I  - N n''' ] 


(47) 


where  ['  is  arbitrary  and  f indicates  the  pseudo  inverse. 

We  have  presented  the  equations  which  must  be  solved  for  an  optimal  F; 
however,  it  should  be  noted  that  both  F and  A must  have  all  eigenvalues  with 
negative  real  parts  for  the  solution  to  be  applicable.  The  steady  state 
value  of  p,  the  expected  value  of  the  state  must  be  zero  therefore;  and  con- 
sequently, the  required  filter  is  of  the  form  indicated  by  (28),  with  no 
bias  removing  function,  g(t). 

It  is  clear  that  (43)  cannot  always  be  solved,  and  we,  therefore,  look 
at  the  problem  with  specific  values  of  F.  If  F is  picked  a priori,  then  we 
may  solve  the  algebraic  equations  (37)  and  (39)  for  and  and  hence 

these  are  regarded  as  known  quantities.  The  remainder  of  the  problem  only 
involves  linear  algebra.  We  illustrate  this  by  example.  Suppose  F,  U,  and 

are  matrices  of  the  form 
ee 


F = fl 
U = u I 


(48) 


A = A I 
ee  ee 

where  f,  u,  and  A are  scalars.  Then  (35)  and  (40),  after  substitutina 
ee 

from  (41)  can  be  evaluated  as 


1 

^12 

P 

xe 

II 

1 

! ^ 

1 o 

^22 

. “2 

(49) 


as 
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At  this  point,  the  gain  K can  be  evaluated  by  substituting  the  solution  to 
(49)  in  (41).  We  note  that  if  the  solution  is  to  be  valid,  then  all  of  the 
eigenvalues  of  the  matrix 


(51) 


must  have  negative  real  parts.  The  author  would  like  to  point  out  that  the 
matter  of  a computational  search  for  a value  of  F that  will  lead  to  good 
performance  when  used  in  conjunction  with  the  above  equations,  appears  to  be 
complicated  by  the  fact  that  the  matrix  (51)  must  be  checked  for  stability  at 
each  stage  to  insure  that  the  equations  are  meaningful. 

In  summary,  reaardinq  the  steady  state  problem,  one  must  be  aware  that 
a solution  to  the  reduced  order  problem  may  not  be  possible,  even  when  a 
solution  for  the  corresponding  finite  interval  problem  is  available. 
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b)  Computational  Savings 

In  this  section  a few  remarks  are  made  about  on-line  computational  sav- 
ing, because  this  is  the  factor  that  motivates  the  work.  A reduced  order 
filter  of  dimension  i of  the  form  indicated  by  (17),  with  an  observation  of 
dimension  m requires 

i (s,  + m)  multiplies 
and 

f,  (ii  + m - 1)  adds 

for  each  on-line  filter  update.  A Kalman  filter  of  dimension  n would  require 

n (n  + m)  multipl ies 
n (n  + m - 2)  adds 

for  each  on-line  filter  update,  assuming  that  all  the  necessary  filter  param- 
eter were  stored.  The  savings  in  calculations  are  significant  when  the  reduced 
order  filter  is  used;  for  example,  if  a filter  of  dimension  2 is  used  to 
estimate  2 state  variables  of  a 10th  order  system,  and  only  a scalar  observa- 
tion is  available,  then  only  5%  of  the  calculations  of  a Kalman  filter  are 
needed  per  update.  Of  course,  this  difference  would  be  even  more  impressive 
if  n were  greater  than  10.  Performance  of  a reduced  order  filter  may  be 
very  good  as  indicated  by  the  example  in  [3]  (JACC  version),  and  the  off  line 
computation,  though  it  may  be  extensive,  definitely  is  feasible. 

IV.  AREAS  OF  FUTURE  RESEARCH 

A number  of  interesting  subjects  have  come  forth  during  the  course  of 
this  research  and  have  not  been  properly  resolved  at  this  stage.  We  will 


r 
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mention  them  briefly  in  this  section.  One  of  the  basic  issues  that  we  have 
not  confronted  is  "how  does  one  find  an  optimal  solution  when  it  is  not  pos- 
sible to  find  an  F mat>^ix  that  maintains  a singular  arc?"  We  have  chosen  to 
look  at  suboptimal  solutions  in  this  case,  rather  than  dealing  with  this 
central  issue.  Another  point  of  interest  is  with  regard  to  the  control  of 
stochastic  systems.  It  is  of  interest  to  know  how  well  a controller  would 
perform  if  the  control  were  required  to  be  the  output  of  a reduced  order 
filter,  designed  according  to  the  procedures  we  have  presented  here,  as 
opposed  to  those  in  [6]. 

The  reduced  order  modeling  problem  introduced  a novel  aspect  to  the  work, 
i.e.  can  we  do  better  than  we  have  been  doing  with  respect  to  minimizing  a 
quadratic  error  criterion,  if  we  drop  the  requirement  that  the  estimate  be 
unbiased?  Based  on  the  reduced  order  modeling  paper  in  the  appendix,  it 
appears  that  the  answer  is  yes  when  F is  selected  a priori. 

We  have  seen  that  the  steady  state  soljtion  to  the  problems  we  have  con- 
sidered does  not  exist  when  A is  an  unstable  matrix,  since  this  would  imply 
that  was  unbounded,  and  (37)  would  not  apply  of  course.  Another  area 

that  we  must  look  into  is  the  conditions  for  existence  of  the  solutions  of  the 
linear  two-point  boundary  value  problems  that  we  have  derived  in  this  research. 

V.  SUMMARY  AND  CONCLUSIONS 

Significant  progress  has  been  made  during  the  course  of  this  research  in 
several  areas  which  we  had  sought  to  investigate  at  the  onset  of  this  project. 
We  have  looked  at  both  continuous  and  discrete  problems,  with  both  additive 
and  state  dependent  noise  in  each  case.  It  has  become  clear  that  the  discrete 
problems  are  quite  different  than  the  continuous  time  problems,  primarily 
because  of  the  quadratic  occurrence  of  the  F matrix  in  the  discrete  time 
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problems,  while  the  corresponding  matrix  occurs  only  linearly  in  the  contin- 
uous time  problem,  giving  rise  to  a singular  solution.  It  would  be  interest- 
ing to  take  the  limit  of  the  discrete  solution  as  the  sampling  interval  goes 
to  zero  to  obtain  the  continuous  time  solution  [7],  to  clarify  the  relation- 
ship between  the  two  kinds  of  problems. 

The  author  feels  that  the  "specified  F"  solution  obtained  in  this  research 
are  of  considerable  practical  importance  in  the  design  of  reduced  order  fil- 
ters. The  reason  that  they  are  important  is  that  they  maintain  the  TPBVP 
nature  of  the  reduced  order  filtering  problem,  so  that  optimization  looks 
beyond  one  stage  ahead,  however  the  TPBVP  is  linear.  It  is  linear  even  in 
the  case  of  having  state  dependent  noise!  The  solution,  when  it  exists,  can 
therefore  be  obtained  off  line  in  a predictable  number  of  steps.  This  is 
unlike  the  case  of  nonlinear  TPBVP's  which  are  generally  solved  by  quasi- 
linearization or  some  such  method,  and  which  may  or  may  not  converge,  and  may 
be  very  sensitive  to  some  initial  guess  at  a solution.  Since  the  TPBVP's  we 
are  dealing  with  are  generally  of  high  order,  it  is  fortunate  indeed  that 
they  are  linear.  A logical  procedure  for  solving  reduced  order  filtering 
problems  which  we  are  currently  working  on  is  to  obtain  an  F matrix  that  is 
optimal  with  respect  to  the  next  stage,  as  in  [2],  then  modify  the  results 
by  solving  the  linear  TPBVP  to  get  the  corresponding  optimal  gain. 

As  we  have  acknowledged  in  the  previous  section,  there  are  still  a number 
of  unanswered  questions  in  this  area  of  reduced  order  filtering.  The  author 
believes,  however,  that  the  methods  which  have  been  investigated  here  are  very 
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promising. 
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quadratic  penalty  function  of  the  equation  error. 
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1.  INTRODUCTION 

It  is  often  convenient  to  model  a high  order  system  using  a low  order 
dynamical  model.  Several  researchers  have  considered  this  type  of  problem  [1]  - 
[8].  In  a recent  paper,  Obinata  and  Inooka  [8]  have  treated  the  problem  of 
obtaining  a reduced  order  time  invariant  model  which  minimized  a steady  state 
quadratic  error  criterion.  In  this  paper  a similar  viewpoint  is  taken,  however 
we  solve  the  nonstationary  problem  with  a white  noise  input,  and  an  integral 
quadratic  performance  measure.  Optimal  control  theory  is  used  to  obtain  a 
solution,  which  is  conveniently  found  using  the  matrix  minimum  principle  [9]. 

The  problem  is  similar  to  the  filtering  problems  posed  by  Lee  [lOJ,  and  Sims 
and  Asher  [11],  in  that  a singular  solution  is  obtained.  When  a singular 
solution  is  not  possible  or  is  considered  too  complex,  a suboptimal  solution  is 
suggested . 

2.  PROBLEM  STATEMENT 

We  consider  the  linear  stochastic  model 

X (t)  = A (t)x  (t)  + w (t)  (1) 

where  w (t)  is  zero  mean  white  noise  with  covariance  matrix 

E{w(t)  wT(T)i  = Q(t)6  (t-x)  (2) 


The  state  vector  x(t)  is  of  dimension  n,  which  is  presumably  too  large  to  be 
desirable.  The  output  of  the  system,  y(t),  is  a linear  transformation  of  the 
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state  vector, 

y (t)  = C (t)  X (t)  (3) 

and  is  of  dimension  m which  is  much  smaller  than  n.  The  objective  is  to  find 
a reduced  order  model 

y (t)  = F (t)  y (t)  + K (t)  w (t)  (4) 

where  y(t)  is  an  m vector  which  adequately  approximates  y(t).  We  assume 
that  the  initial  statistics  of  the  state  vector  are  known 

E {x(t„)  } = Po 

° (5) 

Var  {X  (tg)  } = Pq 

The  problem  is  to  select  the  matrices  F(t)  and  K(t),  and  the  initial  condi- 
tion y (to)  in  a way  that  minimizes  the  performance  measure 


where  e(t),  the  equation  error  is  defined  as 

e (t)  Ay  (t)  - y (t)  (7) 

Note  that  we  do  not  require  apriori  that  y be  an  unbiased  estimate  of  y, 
although  this  turns  out  to  be  a property  of  the  optimal  solution.  If  one 
does  not  select  F optimally,  however,  it  can  be  advantageous  to  have  a nonzero 
expectation  of  the  error. 
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3.  OPTIMIZATION 

The  first  step  in  the  optimization  procedure  is  to  transform  the  problem 
into  an  equivalent  deterministic  problem.  From  equations  (1),  (3),  and  (4), 
the  error  dynamics  are  qoverned  by  the  stochastic  equation 


0 = Bx  + Fe  + (C-K)  w 


(8) 


where  B is  defined  as 


BAG  f CA  - FC 


The  second  moment  matrices  associated  with  (1)  and  (8)  are  defined  as 


(9) 


(t)  A E {X  (t)  x'  (t)  1 


XX 


P^g(t)A  E (x(t)  e'^(t)}  = P^,(t) 


ex 


Pee  ^ ^ 


(10) 


From  (1)  and  (8),  we  can  see  that  these  matrices  satisfy  the  deterministic 
equations 


P =AP  +P  A^  + Q 

XX  XX  XX  ^ 


xe 


AP_  + P_  + P^g  pT  + 0 (C-  K)T 


xe  XX 


(11) 


Pee  ■ e f-xe  * Pex  * F P,,  * P,,  MC  - K)  Q (C  - K)^ 


We  make  the  assumption  that  yltg)  is  to  be  deterministic.  Then  the  initial 
conditions  for  (11)  are 
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^XX  ^ *'0^  ' ^0  ^ ‘‘o  ‘^0 

*^xe^^o^  " ^^0  ^ '^0  ‘'o  ^ ^^0^  " ‘‘o  y ^^0^ 

Pee  (to)  = C (tj  (P^  + )c'  (t^)-  y(ty^  (t„)  - C (t„)  y^  (tj 

+ y (to)  y"^  (to) 


(12) 


The  performance  measure  (6)  may  be  written  in  terms  of  the  matrix  as 
tf 


J = tr  { I R(t)  Pee(t)  dt  + SPgg(tf)  } 

tn 


(13) 


The  problem  can  now  be  stated  completely  within  a deterministic  framework.  The 
matrices,  F(t)  and  K(t),  and  the  vector  y (t^)  are  to  be  selected  to  minimize 
(13)  subject  to  the  constraints  imposed  by  (11)  and  (12).  The  problem  is 
ideally  suited  to  solution  using  the  matrix  minimum  principle  [9]. 

The  Hamiltonian  is  formed  as 


T 


T 


H tr  {RPge  + Pgg  ^ ee  ^xe  ^xe  Pex  ^ex^ 


(14) 


where  we  have  ignored  the  equation  for  P since  it  is  a known  quantity.  The 

XX 

costate  equations  are  obtained  from  (14). 

-3H 


"xe  = ,p“  = "xe  - »xe  f ■ ‘ee 
xe 


(15) 


and 


eo  3p 


ee 


-R  - F'  A - A F 
ee  ee 


(16) 


The  terminal  conditions  for  (15)  and  (16)  are 


*xe  '‘f>  ' “ 


*ee(tf)  " S 


(17) 
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It  should  be  observed  that  is  simply  the  transpose  of 

The  transversal i ty  condition  applied  at  the  initial  time  provides  the 
correct  initial  condition  for  y.  Th  transversal i ty  condition  is 

tr  A^^(t„))  . 0 

(18) 

Since  and  Uq  are  fixed  and  the  only  allowable  variation  is  in  yCtp).  we 
have  from  (12), 

ti"  l[Agg  (tg)  (v  (tp)  - C (t^)  Up)  - A^^  (tg)  Mp  ] dy"*^  (tp) 

+ d;{tp)[A^g(to)  - C(to)  Up)  - Aex(to)  >^0^^  ^ = 0 

The  above  will  be  satisfied  for  arbitrary  variations  in  y(tp)  only  if  the 
equation 

*ee"o)  C(t„)  „„  + A„(t„)  (20) 

is  satisfied.  If  (tp)  is  nonsingular  then  y(tQ)  is  given  by 

y(t„)  - [C(to)  -fA^'d^)  A,,(t„)  ] (21) 

Next  we  consider  optimization  with  respect  to  the  gain  matrix,  K(t). 

Taking  the  gradient  of  the  Hamiltonian  with  respect  to  K and  setting  it  equal 
to  zero  gives  the  expression 

A (t)  K(t)  Q(t)  = A^  (t)  Q(t)  + A^  (t)  C(t)  Q(t)  (22) 

ee  ee 

Equation  (22)  will  have  a solution  K(t)  if  the  equation, 

Pee  «ee  t 'ex  ' »ee  1 <>  = ['ex  * 'ee  ^ 
is  satisfied  at  time  t.  If  a solution  exists,  it  is  of  the  form 


K = [h  + A C ] Q + r - a'^  A r Q Q 
ee  ex  ee  ee  ee 


(24) 


where  r is  an  arbitrarv  matrix.  In  the  above  expressions  t is  used  to  indicate 
the  pseudo  inverse 
solution  given  by 


the  pseudo  inverseof a matrix  [12].  When  A^^  is  nonsingular,  (22)  always  has  a 


K = [ C + A Ag^  ] Q + r [ I - Q ] 


(25) 


In  reality,  Q will  seldom  be  nonsingular,  however  if  it  is,  (25)  simplifies  to 

K (t)  = C (t)  + a"^  (t)  A (t)  (26) 

ee  ex 

and  a unique  expression  is  obtained. 

Obtaining  the  matrix  F (t)  is  considerably  more  involved  because  it  appears 
linearly  in  the  Hamiltonian.  It  therefore  leads  to  a singular  type  of  optimi- 
zation problem.  The  part  of  the  Hamiltonian  which  depends  on  F can  be  written 
as 


H*  = F 0 + o"’”  F^ 


(27) 


where  o is  defined  as 


G)  A (P 


ex 


C P ) 

XX 


A + (P 
xe  ee 


C P ) 
xe 


A 

ee 


(28) 


From  (20),  it  is  clear  that  o(t^)  is  zero.  If  we  can  show  that  Q(t)  is  zero 
during  the  interval  [t^  , t^J,  than  a singular  arc  exists.  Taking  the  time 
derivative  of  Ogives 


0 = F 0 - 0 F - KQ  [ A^^  + (c-  K)^  Agg  ] + (CP^^  ' ^ee  ^ 


The  third  term  in  Equation  (29)  is  always  zero  when  K is  selected  optimally, 
as  can  be  seen  from  Equation  (22).  Therefore,  if  R is  zero,  a singular  arc 
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exists  independent  of  the  choice  of  F.  Generally,  however,  R will  be  positive 
definite,  and  to  insure  a singular  arc  we  must  have  the  term  nACP  - P 1 

be  zero  in  addition  to  the  requirement  of  an  optimal  selection  of  K.  We  can 
have  !i^(t)  = 0 in  the  interval  of  interest  if  S2(tQ)  is  zero  and  fi(t)  is  zero 
in  the  interval  [t^  , t^  ].  The  time  derivative  of  can  be  shown  to  be 

= n + F u + (P  -CP  ) - KQ  (C  - K)''^  (30) 

0X  XX 

If  B is  zero,  then  from  (15)  and  (17)  it  can  be  seen  that  A is  zero.  An 

Xe 

optimal  choice  of  K thus  insures  that  the  last  term  in  (30)  is  zero,  and  (30) 
is  really  a homogeneous  equation  in  S2  when  B is  zero.  From  (20),  and  the  fact 
that  A (tn)  is  zero,  we  have  that  u{t)  is  zero  and  consequently  a singular 

Xe  o 0 

arc  will  exist  if  K is  selected  optimally  and  B is  required  to  be  zero.  This 
last  requirement  then  is  that 

B = C + CA  - FC  =0  (31) 

A solution  to  (31)  exists  if  and  only  if 

(C  + CA)  C^C  = C + CA  (32) 

If  a solution  exists,  it  is  of  the  form 

F = (C  + C A)  C ^ + r*  ( I - C C^)  (33) 

★ 

where  r is  an  arbitrary  matrix. 

We  have  thus  obtained  a solution  to  the  problem  when  it  exists.  The  matrix 

F is  to  be  selected  according  to  (33),  and  assuming  A is  nonsingular,  from 

ee 


(25)  we  have 
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f — — 

where  r is  arbitrary.  From  (21),  the  appropriate  initial  condition  is 

»(‘o>  • C(‘„) 

From  (3),  (5),  and  (35),  it  is  clear  that 

E { e (t^)  } = 0 (36) 

when  the  solution  is  optimal.  Furthermore,  from  (8)  it  is  clear  that  B = o 
implies  that  E {e(t)}=  o for  Note  that  although  we  did  not  require  a 

priori  that  y be  an  unbiased  estimate  of  y,  that  is  the  way  the  solution  turned 
out. 

Further  remarks  regarding  the  optimal  solution  are  appropriate.  Note  that 

the  weighting  matrices  R,  and  S,  and  the  terminal  time,  t^,  do  not  influence 

the  solution  (except  that  R and  S should  be  positive  definite  symmetric 

matrices.)  More  importantly  from  (33)  and  (34),  we  do  not  have  to  solve  for 

any  matrices  P , A , etc.  The  two-point  boundary-value  problem  (TPBVP)  has 
xe  xe 

in  effect  disappeared.  Not  even  the  initial  variance  P^  influences  the 
solution,  which  has  turned  out  to  be  extremely  simple. 

It  should  have  occurred  to  the  reader  that  (31)  cannot  always  be  satisfied. 

If  one  therefore  selects  F apriori,  without  regard  to  optimization.  Equations 
(15),  (16),  (17),  (21),  and  (24)  can  be  used  to  specify  the  optimum  choice  ofy(tp) 
and  K for  any  given  F.  The  solution,  though  suboptimal  in  an  overall  sense,  may 
be  adequate.  Note  that  in  this  case  the  weighting  matrices  R and  S do  influence 
the  solution.  It  should  also  be  observed  that  there  is  no  TPBVP  to  be  solved 
in  this  case  either,  although  a single-point  boundary-val ue  problem  must  be 
solved  off  line  to  evaluate  the  optimal  gain. 
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4.  EXAMPLES 


In  the  first  example,  we  illustrate  a case  where  a solution  exists,  and 
is  easily  obtained.  The  system  is  as  described  by  (1)  and  (3)  with 


; C (t)  = [cos  t sin  t ] 


and  Q nonsingular.  The  reduced  order  solution  obtained  from  (33),  (34),  and 
(35)  is  characterized  simply  and  uniquely  by 

F(t)  = 0 

K (t)  = [cos  t sin  t] 


y (trv)  = [cos  t sin  t„]  y 
“^  0'  0 0-'  0 


If  Q is  singular  however,  the  result  is  not  unique.  For  example  if 


0 0 


0 q 


then  instead  of  the  expression  (38)  for  K,  we  have 


K (t)  = [ Y (t)  sin  t ] 


where  Y(t)  is  arbitrary.  This  makes  sense  of  course  because  (40)  implies 
that  whatever  multiplies  w^  (t)  in  the  solution  makes  no  difference. 

We  shall  now  consider  a problem  where  (31)  cannot  be  satisfied.  A first 
order  model  of  the  form  indicated  by  (4)  is  to  approximate  a second  order 
model  of  the  form  indicated  by  (1)  and  (3)  with 


i 


i 
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A = 


a ^ 0 

0 01^ 


; C (t)  = [t  tj 


(42) 


with  Q>o  and  / a^.  Because  (31)  cannot  be  solved,  and  for  reasons  of 


simplicity,  F is  selected  to  be  zero.  The  performance  measure  to  be  minimized 
is 

T r.  o 


J = E { f e'^  (t)  d t + e'^  (T)  } 
0 


(43) 


The  optimum  values  for  the  gains  are  found  to  be 


T 


K.  (t)  = t + (1  + T - t)'^  e'“''^  f e^i^  u (t)  di  ; i = 1,  2 (44) 

1 t a^-- 


where 


u (T)A(l  + T)  + (a.T  + a-  l)T-a.T  ;i  = l,2  (45) 

- 1 1 1 


«i 


We  emphasize  that  F = o is  an  ad  hoc  decision  for  this  second  example,  while 
a unique  solution  for  the  first  case.  For  the  second  example  the  initial 
condition  is,  according  to  (21), 


y(o)  = (1  + T) 


•1 


T 


r y . u (t)  di 

^ 01  0 a. 

i=  1 ^ 


(46) 


SUMMARY 


In  this  paper  we  have  developed  new  methods  of  reduced  order  modeling  for 
time  variable  systems  with  white  noise  inputs.  The  criterion  used  for  purposes 
of  optimization  has  been  an  integral  quadratic  function  involving  the  equation 
error.  When  a singular  solution  to  the  reduced  order  modeling  problem  can  be 
obtained,  it  has  been  demonstrated  to  be  extremely  simple.  It  was  shown  that 
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the  optimal  solution  must  provide  an  unbiased  estimate.  When  the  singular 
optimal  solution  cannot  be  obtained,  a suboptimal  easily  implemented  procedure 
has  been  developed  for  obtaining  the  reduced  order  gain  matrix.  Both  the 
optimal  and  suboptimal  solutions  have  been  illustrated  by  example. 
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1.  INTRODUCTION 


There  has  been  considerable  work  dealing  with  the  topic  of  filter- 
ing for  problems  with  state  dependent  noise  [1-3].  As  well  as  being  of 
theoretical  interest,  the  topic  is  of  some  practical  importance  since 
many  systems  are  better  modeled  as  having  multiplicative  disturbances 
instead  of  additive.  One  example  occu“S  in  the  momentum  exchange  method 
for  regulating  the  angular  procession  of  a rotating  space  craft  [4]. 

There  is  a disturbance  which  depends  on  the  procession  rates.  Another 
example  occurs  in  the  design  of  phase  lock  loops  [2].  The  phase  insta- 
bility of  an  oscillator  described  in  rectangular  coordinates  appears  as 
white,  state  dependent  noise.  If  one  received  a signal  which  consisted 
of  a large  numb'‘r  of  sinusoids  of  various  frequencies,  each  having  phase 
distortion,  then  one  would  have  to  build  a high  order  filter  to  recover 
the  signal  using  existing  methods. 

The  design  of  high  order  filters  is  often  problematic  from  the  view- 
point of  on-line  computation.  Therefore,  a number  of  researchers  have 
been  interested  in  designing  filters  of  reduced  order  [5-8].  It  often 
happens  that  one  is  only  interested  in  estimating  a lower  order  linear 
transformation  of  a state  vector,  and  it  seems  reasonable  to  attempt  to 
do  this  with  a lower  order  filter.  Design  of  the  filter  parameters  is  a 
fixed  configuration  optimization  problem  [8-10].  In  such  problems,  the 
structure  is  not  necessarily  optimal,  but  given  the  structural  constraints, 
the  parameters  are  selected  optimally.  It  is  interesting  to  note  that 
these  problems  often  have  non-unique  solutions  because  there  are  too  many 
free  parameters.  This  feature  can  be  used  to  obtain  filters  which  are 
easier  to  implement  than  well-known  techniques  such  as  Kalman  filtering. 
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even  when  the  fixed  configuration  filter  is  of  full  order  [8].  In  some 
cases,  there  is  no  performance  loss  associated  with  the  alternative  linear 
filter,  [8],  [11]. 

In  this  paper  we  seek  to  extend  the  reduced  order  filtering  results 
developed  in  [8]  to  problems  with  state  dependent  noise.  The  problem 
is  similar  to  that  considered  in  [12],  however,  in  [12]  a discrete  system 
model  was  considered,  and  only  a single  stage/optimization  was  performed. 
Here  a continuous  time  problem  is  considered,  and  the  matrix  minimum 
principle  [13]  is  used  to  obtain  a solution.  Because  we  allow  a driving 
term  in  the  filter  to  remove  any  a-priori  bias,  it  turns  out  that  the 
problem  has  singular  arcs,  which  is  not  surprising  considering  previous 
works  [8],  [11]  in  the  area.  A very  nice  feature  of  the  work  is  that 
in  some  cases  only  linear  two-point  boundary  value  problems  are  obtained. 
These  can  be  solved  either  by  a direct  use  of  linear  systems  theory  or 
by  a Riccati  equation  technique.  Under  certain  circumstances  only  a 
single-point  boundary- val ue  problem  must  be  solved. 


II.  PROBLEM  STATEMENT 

The  system  of  interest  is  assumed  to  be  modeled  by  the  Ito  stochas- 
tic differential  equation 

dx(t)  = A(t)x(t)dt  + dw(t) 


n 

- L 

i = l 


Xi(t)-iii(t)  Gi-(t)dv(t) 


(1) 


where  x(t)  is  the  state  vector  of  deminsion  n and  u(t)  is  the  mean  value 

of  the  state  vector.  The  disturbances  are  zero  mean  incremental  Wiener 

processes  with  covariances 

Ejdw(t)dw''’(t)(  = Q(t)dt  (2) 

E|dv(t)dv^(t)!  = H(t)dt 
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It  is  not  hard  to  show  [l4]  that  the  mean  value  vector,  satisfies 

du(t)  = A(t)u(t)dt  (3) 

The  initial  condition  for  (1)  is  random  with  known  mean  and  variance 

£{At^)\=Uo  (4) 

Var  I x(t^)  } = (5) 


Equation  (4)  is  obviously  the  initial  condition  for  (3). 

The  observation  vector  is  also  corrupted  by  state  dependent  noise. 

n 

dy(t)  = C(t)  xit)dt+dv(t)  + 

i = l 

In  (6),  y(t)  is  the  observation  vector  of  dimension  m,  dv(t)  is  the 
additive  measurement  disturbance,  and  dv(t)  is  the  multiplicative  dis- 
durbance.  The  vector  v(t)  may  be  large,  and  some  of  its  elements  affect 
the  dynamic  model  through  the  terms  G.j , while  others  affect  the  obser- 
vational model  through  the  terms  M.j . The  additive  disturbance,  dv(t) 
is  a zero  mean  incremental  Wiener  process  with  covariance 

E { dv(t)dv^(t)  1 = R(t)dt  (7) 

The  terms  w(t),  v(t),  \{t)  and  x(to)  are  uncorrelated. 

Only  a linear  transformation  of  x(t)  is  to  be  estimated,  i.e.,  it 
is  desired  to  estimate 


x.j(t)-M.j(t)  M.j(t)dv  (6) 


z(t)  = N(t)x(t) 


(8) 


where  z(t)  is  a vector  of  dimension  )i4n. 

The  estimate  of  z(t),  which  we  call  z(t)  is  constrained  to  be  obtained 
by  the  filter  equation 

dz(t)  = |^F(t)z(t)  + g(t)j  dt  + K(t)dy  (9) 
The  vector  g(t)  and  the  initial  condition,  z(t^)  are  to  be  selected 
so  that 


E {e(t) f = 0 Vt  =|to,  t^ 


(10) 
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where  e(t)  is  the  error  vector 


e(t)  = z(t)  - z(t) 


(11) 


The  matrices  F(t)  and  K(t)  are  then  to  be  selected  so  that  a quadratic 
performance  measure  t^ 

(12) 


J = E{  / e^(t)  Qe(t)dt  + e^(tf)Se(tf ) ( 
t. 


is  minimized.  The  weighting  matrix  S is  assumed  to  be  positive  definite 
symmetric.  The  weighting  matrix  Q may  be  positive  definite  or  zero  and 
is  critically  important  to  the  solution. 

III.  GENERAL  SOLUTION 


In  order  to  proceed,  it  is  convenient  to  develop  an  equation  for 
the  error.  From  the  Ito  differential  rule  [^isj  , it  is  seen  that 

dz(t)  = N(t)dx(t)  + N(t)x(t)dt  (13) 

Using  (6),  (9),  and  (13)  it  is  seen  that  the  differential  equation  of 
the  error  is 

de(t)  = dz(t)  - dz(t) 


or 


de  = 


(NA-FN-KC+^)  X - g 


dt  + Ndw  - Kdv 
n 


+Fedt+  rNy;  x.G.  - kY,  x.M.l  dv 

Lit  ^ 1 


i = l 


i = l 


(14) 


In  (14)  we  have  introduced  the  notation,  x = x-ti.  From  (14)  it  is  seen 
that 


provided  that 


^Eje(t)[=  F(t)E)e(t)} 
dt 


g(t)  = (NA-FN-KC+N)!ilt) 
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(16) 


If  furthermore 


it  is  clear  that 


z(t  ) = N(t  )u(t  ) 
0 0 O 


E |e(t^)l=  0 


From  (15)  and  (18),  one  can  see  that  (10)  is  satisfied  so  that  (16) 

and  (17)  are  appropriate  selections.  If  g(t)  is  selected  according  to 

(16),  the  error  differential  equation  can  be  written  as 

de  = (NA-FN-KC+N)xdt  + Ndw-Kdv 
n _ 

+ Fedt  X-  (NG.-KM.)dv  (19) 

i = l ^ 

The  equation  for  x is 

0,  - 

dx  = Axdt  + dw  + ^ x^G^-dv  (20, 

i = l 

Clearly  x and  e are  both  zero  mean  processes. 

If  (19)  and  (20)  are  put  in  1 equation,  it  is  easy  to  see  how 
the  second  moment  matrix  defined  as 

fn  /4.\  I n nr(C/'4-\wT/,.\l  I + 


X 

X 

-ex(t) 

^ee  ^ ^ 

propagates.  This  is  useful  since  the  performance  measure  (12)  may 
be  written  as 

tf 

J = tr|  J QPgg(t)dt  + SPee(t^)}  (22) 

^0 

If  one  has  the  appropriate  constraint  equation,  the  optimal  selection 
of  F(t)  and  K(t)  may  thus  be  solved  with  deterministic  theory  using  the 
matrix  minimum  principle. 

Equations  (19)  and  (20)  may  be  written  as 
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t 


dx(t) 

' A 

0 1 

X 

dt  + 

dw 

_de(  t)_ 

(NA-PN-KC+N) 

F 

e 

Ndw  - Kdv 

+ X.  I.j  dv 


where 

G. 
1 


NG.  - KM. 
1 1 


The  second  moment  matrix  associated  with  (23)  satisfies  [4], 
P = GP  + PG^  + q + i 


(23) 


(24) 


(25) 


where 


A 

0 

(NA-FN-KC+N) 

F 

(26) 


Q 

qn’’’ 

NQ 

KRk"^  + NQN^ 

(27) 


I 


and 


j = l 


Partitioning  P in  (25)  we  obtain  the  individual  equations. 


(28) 
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P =AP  +PA+Q  +'l/i 

XX  XX  XX  i- 


(29) 


P«p  = [na-fn-kc+n]  p + [na-fn-kc+n 

ee  [_  J xe  ex  L 


+ FPgg  + PggF^  + NQN^  + KRK^  + K J 


- N 'V  - K + N 


(30) 


and 


Ke  = *'■«  * '■xx  (NA-FN-KC*N)^  ♦ 


+ QN^  + - 'i'2K^ 

In  (29),  (30),  and  (31),  the  terms  vj/^,  and  >1^3  are  defined  as 


(31) 


i = l 


P G, 

XX. j 1 


- r T 


G. 

J 


(32) 


j = l 


% = ii 


i = l 
j = l 


P G. 

XXij  1 


- M T 


= M, 


- J 


(33) 


'Vo  = E 


i = l 

j = l 


3 M. 

XXij  1 


- » T 


(34) 


The  term  P.^  is  simply  the  transpose  of  P . Clearly  P can  be  calculated 
independently,  and  can  thus  be  regarded  as  a known  quantity.  The  problem 
is  to  select  K and  F so  that  (22)  is  minimized  subject  to  the  constraints 
imposed  by  (30)  and  (31). 

The  Hamiltonian  for  this  problem  is  then 

(35) 


H = tr  •!  OP  + P ^ P + P i 

" I ' ee  ee  ee  xe  xe  ex  ex  • 


J 
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where  A^g , A^^g,  and  are  Lagrange  multiplier  matrices  associated  with 
Pge » f’xe  ’ ^ex  <^espectively.  The  constraint  equation  for  Pgx  is  in- 
cuded  for  symmetry. 

The  optimal  solution  for  the  gain  K(t)  is  obtained  by  setting  the 
gradient  of  H with  respect  to  K equal  to  zero.  This  leads  to  the  ex- 
pression for  K. 

-1 

^ • \e''[iV  "’ex':"  t N-lx)  ♦ [r  x-P,]  (36) 

where  the  required  inverses  are  assumed  to  exist.  The  Lagrange  multiplier 
matrices  satisfy  the  equations 

- {Q  +AeeF  + Fr\ee|  (37) 

- { (NA-FN-KC+N)''' Agg  + A^A^g  ^ i (38) 

The  matrix  Ag^  is  just  the  transpose  of  Axe-  initial  conditions  for 

(29),  (30),  and  (31)  are 

and 

''xe"o'  ■ Po  Pee^o’  ' " "o>  "o 

The  terminal  values  for  (37)  and  (38)  are  as  required  by  the  transversal ity 
condition  applied  at  the  terminal  time 

-V  (V>  ■ 5 

•'xe 


ee 


.3H 

3P. 


ee 


and 


xe 


=-3H 


xe 


and 
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(41) 

(42) 


Notice  that.\gg(t)  can  be  computed  separately  without  solving  the  rest 
of  the  problem  if  F is  known  beforehand.  However  at  this  point,  we  have 
not  yet  determined  how  F should  be  selected.  It  will  be  seen  that  this 
depends  in  a critical  way  on  the  nature  of  Q.  We  will  consider  two 
different  classes  of  problems. 

CASE  1. 

In  this  case,  we  assume  that  Q = 0.  The  meaning  of  this  is  that 
the  quality  of  the  estimation  algorithm  is  only  important  at  the  terminal 
time.  This  may  make  sense  for  rather  a large  class  of  problems.  The 
reason  that  this  case  is  of  particular  interest  is  that  the  selection 
of  F does  not  affect  the  Hamiltonian,  so  that  we  are  free  to  select  its 
value  based  on  other  considerations. 

Consider  that  part  of  the  Hamiltonian  which  depends  explicitly 

on  F. 

H*  = tr  |FB+B^F^|  (43) 

where 

(Pee'^Pxe)  Aee  " (Pex ‘^Pxx ) 'Ve  (^4) 

From  (39)  and  (40)  it  is  clear  thato(tp)  0.  If  it  can  be  shown  that 
B(t)  = 0 for  all  t in  the  interval  of  interest,  then  a singular  arc 
exists.  The  Hamiltonian  is  independent  of  F.  In  this  case,  one  does 
not  need  to  specify  F to  stay  on  the  singular  arc.  Differentiating 
Ogives 

(45) 


Ci'pi/,  l<().  / Net  Ava<(abi’(’,  Joii'inaC  Uytdcrided 


The  bracketed  term  in  the  above  is  zero  whenever  K is  chosen  optimally, 
i.e.,  according  to  (36).  Hence 


0(t)  = F(t)e(t)  -0(t)F(t)  (46) 

and  (46)  implies  that0(t)  = 0 for  all  t it^  since  f^(tQ)  = 0.  The 
selection  of  F is  thus  not  a performance  factor.  It  may  be  selected 
a-priori  so  thatA^g(t)  can  be  precomputed.  It  may  be  selected  so  as 
to  achieve  some  other  objective  such  as  reduced  sensitivity,  computational 
convenience  or  to  minimize  some  alternative  performance  measure  specifi- 
cally involving  F. 

When  one  thinks  about  it,  the  singularity  with  respect  to  F is 
not  particularly  surprising.  Clearly  two  different  filters  can  even  pro- 
duce the  same  output  at  a particular  time,  given  the  same  input.  What  is 
interesting,  is  that  this  fact  is  generally  overlooked,  and  as  the  example 
problem  will  show,  that  an  alternative  filter  structure  can  be  relatively 
easily  implemented. 

CASE  II. 

In  this  case  the  weighting  matrix,  Q,  is  a positive  definite  sym- 
metric matrix.  When  one  develops  an  expression  for0,  the  result  is 

0=  F0-  0F  +0Q  (47) 

instead  of  (46) , where 

0=  NP  - (48) 

xe  ee  ' ' 

Thus  unless  O is  zero,  a singular  arc  does  not  exist. 
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It  is  easily  seen  thatli(t)  does  not  equal  zero  unless  F is  sleeted 
appropriately.  From  the  initial  conditions,  n(t^)  = 0.  Taking  the 
time  derivative  of  ilwe  get 

(l=  FIl+llF^  + (NA-FN-KC+N)^ 

-K  [rK^- + '1' K^-CP^  1 (49) 

Examining  the  last  equations  we  see  that  if 

(NA-FN-KC+N)  =0  (50) 


then 

il  = Ffl  + llF^  (51) 

This  follows  from  the  fact  that  when  (50)  holds,  A^^(t)  is  zero  for 

all  t in  the  interval.  Consequently  the  expression  for  the  gain  becomes 

-1 

K = [P„CT  * N ^ * '*'3  ] (52) 

and  (52)  is  sufficient  to  have  the  last  term  in  (49)  be  zero.  In  view 
of  (51)  and  the  fact  that  II (to)  is  zero,  it  is  clear  that  II (t)  is 
zero  for  all  te  t^,  t^J  provided  that  (50)  holds  and  that  the  gain  is 
selected  optimally. 

When  12 (t)  is  zero,  it  may  be  seen  that  the  orthogonality  require- 
ment is  met  in  a reduced  state  space,  i.e. 

I2(t)  = N(t)Pxe(t)-Pge(t)  = E | [z  ( t)-e(  t)]e^(  t) } = 0 (53) 

Since  z = z-e,  (53)  may  be  written  as 

E {z(t)  e'^(t)  t = 0 (54) 


( 

> 

i 
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so  that  what  we  have  required  for  singularity  is  that  the  error  and 
the  estimate  be  orthogonal. 


When  N is  the  identity  matrix  and  there  is  no  state  dependent 
noise,  the  result  is  the  Kalman  filter,  with  the  requirement  tbO)  that 


F(t)  = A(t)-K(t)C(t) 


which  of  course  means  that  the  filter  is  of  full  order.  When  the  filter 
is  of  reduced  order,  and  N is  constant,  what  we  have  is  the  observer 


constraint  equation  [l6] 


NA-FN-KC  = 0 


In  general,  when  Q>0,  (50)  is  a necessary  condition  for  a singular  arc. 
Clearly  it  is  not  always  possible  to  select  F to  satisfy  (50).  In  such 
cases,  the  problem  needs  to  be  reformulated  so  that  an  unbounded  F is 
not  indicated.  A1 ternati vely  a suboptimal  solution  can  be  accepted. 

We  will  examine  this  topic  in  the  next  section. 

A necessary  and  sufficient  condition  that  (50)  have  a solution 


F,  is  that 


[nA-KC+nJ  = I^NA-KC+N  Vte|tQ,tfj 


If  (57)  holds  then  a solution  is 


= [nA-KC+n]  n’^+  1'  I-NN^] 


where  N^is  the  pseudo  inverse  of  N and  where  I' is  an  arbitrary  ex? 
matrix  [17] . When  the  matrix  (NN^)  is  nonsingular  then  the  solution 
(58)  can  be  written  as 


F = I^NA-KC+wj  Fnn''^ 


IV.  SPECIFIC  F SOILITIONS 


In  the  preceding  section  we  have  shown  that  when  one  is  only 
interested  in  estimation  at  a particular  time,  the  selection  of  F may 
be  based  on  considerations  other  than  optimality,  so  that  one  may 
pick  it  prior  to  optimization.  Furthermore,  when  Q >0,  it  may  not 
be  possible  to  find  an  F which  results  in  a singular  arc.  In 
that  case  one  may  opt  to  select  F prior  to  optimization.  In  this 
section,  we  will  see  that  when  F is  selected  a priori,  the  two  point 
boundary  value  problem  which  must  be  solved  for  the  selection  of  K 
is  linear,  and  hence  relatively  easy  to  solve. 

Consider  substituting  the  gain  expression  (36)  in  (31)  and  (38). 
The  resulting  expressions  are 


;p  c"*"  + y.,)'*'  A a“^ 
XX  2'  xe  ee 


-(PxxC^+  'Vj(R+vpJ-l 


T T 

+ y'  N + C P 
2 xe 


(60) 


and 


'xe 


= -(NA-FN+N)^  Agg  - Axe  " ^ypl"  + c'  (R+  y,) 


xe 


(P  C^  + y„)^  A + (C  P „ + yj  N^)  A 
XX  2 xe  xe  2 ee 


(61) 


When  F is  known  a priori,  both  P^^  and  Agg  are  known  in  the  sense  that 
they  may  be  precomputed.  The  above  equations  are  then  seen  to  give  a 
linear  TPBVP  in  the  matrices  Pxe  and  A^^g.  The  solution  may  be  obtained 
in  a straight  forward  manner  using  linear  systems  theory,  or  alternatively 
by  assuming  that  the  elements  of  are  linearly  related  to  those  of 
P^g , and  obtaining  a solution  involving  a Riccati  equation.  The  values 
obtained  for  P^g  and  Axe  expression  (36). 
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I 


We  cannot  overemphasize  the  importance  of  the  fact  that  our  result 
is  a linear  TPBVP,  since  it  is  reasonable  to  expect  to  solve  a linear 
matrix  TPBVP.  Often  a nonlinear  matrix  TPBVP  is  so  difficult  to  solve, 
that  the  utility  of  the  result  is  questionable.  We  shall  explain  pro- 
cedures for  solving  a linear  TPBVP  by  looking  at  a particularly  easy 
case  in  which  A is  a scalar  times  the  identity  matrix.  This  results 
when  both  F and  Q are  scalars  times  the  identity  matrix.  When  this  is 
true,  (60)  may  be  written  as 

'^xe  “ *-n*^xe  ^ 

where 

Lii  = A + F - L*  C (63) 

=-L*  (CPxx  + ''^■^ee  (64) 

0^  = Pxx  (NA-FN+N)'''  + Qn"^  + - L* (65) 

and  where 

>-*  = (Pxx^^  + '^2^ 

Equation  (62)  is  of  the  form 

^xe  " ^21  '’xe  '-22  -^xe 

where 

(R+  '*'3)"^  C Age  (68) 

'-22  " ^ 
and 

D3  = -(na-FN+N)^  \gg  + C^(R+  '1^3)-1  ^e  (70) 
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Let  L be  the  matrix 


^11 

1 

CO 

_1 

i 

i 

] . 

L22 

(71) 


and  <Dbe  the  associated  state  transition  matrix  which  can  similarly  be 
partitioned 


'^\l 

(p 

12 

.^21 

4)22  - 

(72) 


Then  the  solution  to  (62)  is 

Pxe(t)  = 'I’ll  (t’  tQ)Pxe(to^  ^‘^12  ''xe^^o^ 

+ f r4>n(t.O  Di(t) 

^0  ^ 

and 

Axe(t)  = '>’21^^’^o)Pxe^^o^  ^ '’’22^^’^o^  ^xe^^o^ 


(73) 


f [4.2i(t,T)  Di(t)  + <D22(t,T)D2(T) 


di 


(74) 


Applying  (74)  at  time  t = t^  gives 

Axe(tf)  = 0 = ‘^21^^f’^o^’'xe^^o)  + ^22(V’^o^  ^e<S^ 

+J'^  j$2i(^f 'i’22^^f ’^^*^2^^^]^^ 


(75) 
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We  can  solve  (75)  for  (to)  and  substitute  the  results  in  (73)  and 
(74)  to  obtain  the  solution  for  all  tGjtQ,  t^J . 

There  is  another  approach  which  is  probably  preferable  in  most  cases. 
We  assume  that  is  linearly  related  to  P^^g  by  the  relationship 


Axg(t)  = u(t)Pj,e(t)  + B(t) 


Differenatiating  (76),  one  obtains  the  differential  equation 


(76) 


^e  - 


Alternatively,  from  (67) 


^ll^xe  ^ ^’'xe  ^ ® 


(77) 


^xe  " ^ ^22  UPxe  + ^-22^  + ^2 


Equating  (77)  and  (78),  we  get  for  U 


U + + ULi2^  = '-21  + L22U 


and  for  B 


(78) 


(79) 


UL12B  + B + UDi  = L22B  + 

Since  A^^g  (t^:)  = 0,  the  terminal  conditions  for  U and  for  B are 


(80) 


U (t^)  = 0 


B (t^)  = 0 


(81) 

(82) 


The  Riccati  equation  (79)  and  equation  (80)  can  be  solved  backwards  in 
time  from  the  above  terminal  conditions.  The  optimal  gain  may  then  be 
expressed  as 

K =rP^g^C^Nf2  + Agg"^  (UPxe+®)^(PxxC^+f2)]['^+'^3]  (®3) 
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and  P)^e  is  evaluated  as 

Pxe  " LjiPxe  ^\2  ['^^xe  ''■6  ■'■ 

The  matrices  U,  and  B must  be  evaluated  off  line,  however,  P^g 

and  K can  be  evaluated  on  line  if  this  is  desired.  Most  likely  these 
would  also  be  evaluated  off  line  and  K stored  for  on  line  calculation 
of  z(t)  using  (9) . 

V.  EXAMPLES 

The  first  example  we  shall  consider  is  of  the  category  discussed  in 
Case  II.  We  assume  that  A(t)  is  zero,  N(t)  = C(t),  and  that  there  exists 
an  F such  that 

FC  = C - KC  Vte  [tg,  tfj  (85) 

then  if  CC^  is  nonsingular 

F = [CC^  - KCC'^J  (CC^)  (86) 

The  filter  equation  is 

dz  = CC^  (CC^)  ^ zdt  + K [dy  - zdt]  (^7) 


The  initial  condition  for  (87)  is 


where  is  the  solution  to 


*^xe  ^ j + (Q+'Vj)  C^-'V2 

Alternatively  since  CP^g  = P^^,  it  may  be  desirable  to  evaluate  (89) 


(90) 


as 


-1 

K(t)  = * C»2]  [r  t ,3] 


(91) 


where  P^^  is  the  solution  to 


Pee(t)  =[cc'^(cJ)-^  - k]  P^^+P^^ 

4.  pnr*T  I »/ni/  T , T „ 


CC^(CC^)'^ 


+ CQC'  + KRk'  + Kf3K^  - 


Kt2^C^  + Cfjc''’ 


(92) 


The  reason  (92)  is  appealing  is  that  Pge  has  fewer  elements  to  calculate 
then  Pxe . 

The  next  example  is  concerned  with  the  very  simple  problem  of 

estimating  a constant  having  zero  mean  and  variance  1 prior  to  observations. 
The  observation  is  of  the  form 


dy  = X dt  + dv  + Mx  dv 


(93) 


where  v and  u are  zero  mean  white  noise  with  covariance  parameter  l 
and  M is  constant.  We  are  interested  in  estimating  the  value  of  x at 
time,  T.  Hence 


J = E i (T)! 


(94) 


and  this  is  a problem  of  the  category  referred  to  as  Case  1. 
putational  convenience  we  select  F = 0.  The  TPBVP  then  is 


For  com- 


xe 


xe 


^^xe  '''  \€ 

= > (Pxe 


xe 
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(95) 

(96) 


with  (0)  = 1 and  (T)  = 0,  where 

Y A (1  + M^)'^ 


The  solution  is 


Pxe(t) 


1 - T(t-T) 
1 + yT 


and 


A (t) 
xe  ' 


(t  -T) 


1 + yT 


(97) 


(98) 


(99) 


Interestingly,  because  of  the  complimentary  nature  of  Pxe^^^  Axe(t), 
the  gain  is  a constant. 


y 

k(t)  = k = 

1 + yT 


(100) 


The  filter  is  simply 


dx(t)  = 


1 + yT 


dy(t) 


and  the  error  variance  at  time  t = T is 


P (T) 

ee'  ' 


1 + M 


1 + YT  1 + + T 


(101) 


(102) 


The  filter  (101)  is  simpler  to  construct  than  the  choice  which  would 
require  F = -K,  i.e.,  one  of  the  form 

dx(t)  = k(t)  |dy(t)  - x(t)dt]  (103) 
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even  though  ii  ■ obviously  a full  order  filter.  The  authors  feel  that 
the  nonunique  property  of  optimal  linear  filters  for  certain  cases  is 
a feature  which  one  should  take  advantage  of. 

VI.  REMARKS  AND  CONCLUSIONS 

We  have  extended  the  results  of  [8]  to  problems  having  state  de- 
pendent noise  in  the  observation  and  dynamical  equation.  Control 
theoretic  methods  have  been  used  to  solve  the  problem,  and  optimal 
solutions  have  been  shown  to  correspond  to  singular  arcs.  Different 
solutions  result  when  there  is  an  integral  performance  measure  than 
when  only  estimation  at  the  terminal  time  is  important.  In  some  cases, 
we  have  seen  that  it  makes  sense  to  select  the  filter  matrix  ahead  of 
time  and  then  optimize  the  gain.  The  computational  algorithms  associated 
with  such  prior  selection  are  particularly  convenient.  There  are  no 
terribly  difficult  TPBVP's  in  this  approach  and  that  is  why  the  authors 
feel  that  it  is  practical  and  useful,  both  for  full  order  and  reduced 
order  filters.  The  amount  of  off  line  calculation  necessary  to  simplify 
on  line  filtering  appears  to  be  quite  realistic. 
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ABSTRACT 


The  linear  reduced  order  filtering  problem  is  formulated  as  a matrix  two- 
point  boundary-value  problem.  Cases  are  presented  in  which  the  two-point 
boundary  value  problem  simplifies,  and  may  be  solved  with  a reasonable  amount 
of  calculation. 
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INTRODUCTION 

i 

[ In  this  paper  we  consider  estimating  only  a part  of  a state  vector,  using 

' a reduced  order  linear  filter.  The  problem  is  motivated  by  the  fact  that 

sometimes  one  is  only  interested  in  estimating  a portion  of  the  state  vector, 

I while  the  entire  state  vector  may  be  of  large  dimension.  For  example,  the 

i 

i'  entire  state  vector  might  contain  state  variables  associated  with  a detailed 

model  of  the  noise  generating  process,  and  the  filter  designer  might  only  be 
interested  in  estimating  position  and  velocity.  The  complexity  of  the  Kalman 
filter  for  estimating  a large  state  vector  leads  one  to  consider  less  compli- 
cated reduced  order  filter  designs  for  estimating  the  variables  of  interest. 

The  reduced  order  filter  would  typically  have  fewer  on  line  calculations,  but 
a greater  number  of  off-line  calculations  required  for  evaluating  the  filter 
parameters . 

There  has  been  considerable  research  in  the  area  of  reduced  order  filter- 
ing [1-8]  because  of  the  difficulties  involved  in  implementing  a full  order 
filter  of  large  dimension.  This  paper  is  a discrete  version  of  the  work 
presented  in  [8],  wherein  a continuous  time  problem  was  formulated  as  a two- 
point  boundary-value  problem  (TPBVP).  The  discrete  problem  is  quite  different 
than  the  continuous  time  problem,  since  some  of  the  filter  variables  to  be 
optimized  appear  quadratical ly  in  the  discrete  problem  and  only  linearly  in  the 
continuous  problem.  This  paper  also  represents  an  extension  of  [5],  where  a 
discrete  reduced  order  filtering  problem  was  solved,  but  the  parameters  of  the 
reduced  order  filter  were  only  optimized  with  respect  to  performance  at  the 
next  stage.  Here  parameters  are  optimized  over  an  interval.  A solution  is 


t 


obtained  using  the  matrix  form  of  the  minimum  principle  [9].  As  in  [8],  a 
suboptimal  approach  is  suggested  which  leads  to  a linear  TPBVP  to  solve,  and  a 
method  of  solution  is  suggested. 


I 


PROBLEM  STATEMENT 


Consider  a linear  stochastic  process  defined  by  the  discrete  state 
equation 

X =Ax+w;j=0,l,...  (1) 

j + 1 j j j 

where  x-  is  the  state  vector  at  stage  j,  and  w.  is  a zero  mean  white  noise 
*3  J 

vector.  The  system  is  observed  with  noisy  linear  measurements 
yj  = ""j  ^ ''j  ’ j = ••• 


where  y.  is  the  observation  and  v.  is  zero  mean  white  noise.  The  covariance 
J J 

matrices  for  w-  and  v.  are 
J J 


E 

E 


{ w^  w k } = Q . 6 


j k j 


f V ■ V , } = R . iS  , . 

J k J k J 


(3) 


The  noije  terms,  w ; and  v . are  not  correlated  with  each  other  or  with  the 
J J 

initial  condition,  . In  the  problem  considered,  a lower  order  linear 
transformation  of  the  state  vector  is  of  interest 


z.  = N . X . (4) 

J J J 


The  dimensions  of  z^  , x . , and  y . are  5,  n, 
J J J 

Since  z-  is  lower  dimensioned  than  x . , 
J J 

a reduced  order  filter  of  the  form 


and  ni  respectively,  and  f ^ n. 
we  consider  estimating  it  with 


z 


+ 1 


+ K 


j + 1 


(5) 


1 

I 

J 

j 


The  objective  of  the  problem  is  to  select  and  to  satisfy  a require- 
ment that  the  estimate  be  unbiased,  i.e., 


E{e.  1 = Efz^  "Z-il  = 0 
j J J ' 

The  matrices  F.  and  K . are  then  subject  to  optimization  to  minimize  a 
'J  J 


quadratic  performance  measure. 


i»i- 1 y J 

J = E{5:e.  U e.+e  Se  } 
j=o  J j j rn  m 


The  solution  to  the  unbiased  requirement  is  easily  obtained  by  considering 
the  error  difference  equation.  The  optimization  problem  can  then  be  solved 
using  the  matrix  minimum  principle. 


THE  UNBIASED  REQUIREMENT 


By  considering  (1),  (2),  (4),  and  (5),  we  find  that  the  difference 
equation  for  the  error  is 

'jM  -[<"3*1-^  S-.l>*3-  ^3  % * 

(N.  -K.C.  )w  +F  e -K  V -g 

J + 1 J J + 1 j j j j j + 1 j 


Taking  the  expectation  of  both  sides  of  (8)  gives 


F . N . ] M . - q . 

J J J J 


where  uj  ^E  1 Xj  ).  From  (9)  it  is  clear  that  we  will  have  a linear  homo- 
geneous equation  for  the  prior  expectation  of  the  error  provided  that  g is 

J 


3 


selected  as 


g.  =[(N.^,-  K.C.  ,)A.-  F.N.]m- 
‘■'j  + 1 J J + l j J J J 

Hence  E { e . } = 0 V . > o if  E { e „ 1 = 0,  which  will  be  true  if  z is 
J J - 0 0 

chosen  as 


z = N 11 
0 0 0 

With  gj  selected  as  in  (10),  the  difference  equation  for  the  error  is 

e-.i=G.x’.  + D.W.  + F e -K  v. 

J 1 J J J J j j j J + 1 

where  we  have  defined 

G . A C(  N - K . C ) A - F . N ] 

J-  J jJ 

DjA(Nj  + j-  KjCj  + j) 

"x  . AX  - y . 

J - j J 

It  is  clear  that  3<  ^ is  a zero  mean  process  satisfying  - 
’'■j*!  ’ ""j 

The  remaining  problem  is  to  minimize  the  error  criterion  (7),  with  respect 
to  Fj  and  Kj  , subject  to  the  constraints  imposed  by  (12)  and  (14). 

OPTIMIZATION 


The  performance  measure  (7)  may  be  written  in  terms  of  the  error 
variance  matrix 


J=trf7;  U P (j)+SP  (M)l 
j = 0 J e e 
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where 


Ppp(j)AEIe.e.'} 


From  (12)  and  (14)  it  is  seen  that  *he  error  variance  matrix  satisfies 

P e e ( J + 1 ^ j % e ^ J ^ ^ j ^ ^ ^ j % x ^ ^ ^ ^ j ^ ^ ^ j x e ^ J ^ j ^ ^ 

(1 

G.P  (j)G.^  + D.Q.D-'^+K.R.,K.‘'’ 

Jxx'-^'j  JJ  + lj 


where 


Pvp  (j)  = Pex  (j)A  EfXje' j} 


propagates  according  to  the  equation 


P,e(j*U  =AjP,^(j)  FjT  *AjP,,(J)Gj  MjD.Td,) 

From  ( 14) , the  matrix  satisfies 

J 'J 

The  initial  conditions  for  (17),  (19),  and  (20)  are 


P^^(o)  =E{XpXQ  } = Var  { X (o)  ; = P 0 


^’ee^®^  ^o^o^o  ’ ^xe^°^  o ^ o 


The  problem  is  now  completely  within  a deterministic  frame  and  can  be 
solved  using  the  matrix  minimum  principle.  The  Hamiltonian  is  of  the  form 

Hj=  tr  tU3Pgg(j)  + Pee^j-"!^  ^ ^ee^j^  + P e x ^ ^ e x ^ (j  + 

T ( 

+ Pxe  (j  ^ ^xe  (J  + 1)  ' 

where  App  and  ^ are  Lagrange  multipliers,  and  A is  the  transpose  of 


\ 

t 

i 

I 


After  substituting  from  (17)  and  (19),  the  costate  equations  can  be 
found  from  (22). 

Age  (j)  =Uij_  = Uj  + F j (j  + 1)  F j (23) 

^Pee(j) 

and 


^ P X e(  j ) 


The  terminal  conditions  for  (23)  and  (24)  are 


A (M)  = S 
e e 


(24) 


(25) 


and 

A^g  (M)  = 0 (26) 

Setting  the  gradient  of  the  Hamiltonian  with  respect  to  = o gives  a 
necessary  condition  for  optimality. 


Similarly,  setting  the  gradient  with  respect  to  Fj  equal  to  zero  gives 

[%e»>-%x<jl"j"-"/xe'')>  * Nj''xx(o)Nj 
• [«ex(j*‘>  *»ee  (J*»  j +1  - S j * 1 " j 1^'’ x x < j>  S 


(27) 


(28) 

(j)] 


Thus  we  have  the  TPBVP  giving  the  necessary  conditions  for  an  optimum.  In 


i 


general,  such  a problem  is  difficult  to  solve.  There  are  two  cases  in  which 
it  is  not  unreasonably  difficult  to  solve  the  problem.  In  the  first  case,  the 
TPBVP  simplifies  into  a single  point  boundary  value  problem.  The  Kalman 
filter  is  of  this  category.  In  the  second  case,  F ^ , is  not  optimized,  but 
selected  prior  to  optimization.  The  optimization  of  the  gain  Kw  may  then  be 
accomplished  by  solving  a linear  TPBVP  which  is  a routine  procedure. 


SIMPLIFICATION 


In  this  section  we  show  that  there  are  circumstances  where  the  TPBVP 
simplifies  considerably.  Assume  that  it  is  possible  to  select  F^  in  such  a 
way  that  Gj  is  zero,  i.e. 


("jM  =0 


(29) 


Then  from  (24)  and  (26)  it  is  clear  that  (j)  is  zero.  If  Agg  is  non- 
singular, and  if  furthermore 


(j)  APgg  (j)  - Pgx  (j)  N j ' = 0 


(30) 


then  (28)  can  be  written  as 

so  it  is  clear  that  (29)  implies  that  F^  satisfies  the  necessary  conditions 
for  an  optimum.  It  really  isn't  necessary  that  be  nonsingular  for  (29) 

to  imply  (28),  but  it  is  critical  that  n(j)  be  zero,  so  we  investigate  this 
point . 

Using  (17)  and  (19),  and  noting  that  is  zero,  we  obtain 


(31) 
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-Nj.iQjCj,/-  FjP^^(i)  a7c.,,T]k/ 

We  will  show  that  the  bracketed  term  in  (32)  is  zero  when  is  selected 
optimally.  With  Gj  equal  zero,  (27)  will  be  satisfied  if 

K,-  [ C . , , P (j+l)C.,/+R.  ,]  = [ F.  P ^ (j)  A .^  + N . ^ P (j  + 1) 

J j + 1 XX  j + 1 J + 1 J ex''"  j j + 1 xx"^ 

-F  N .P  (j)A  ^]C.  / (33) 

J J XX  j J ^ ^ 

We  substitute  for  Fj  N ^ from  (29)  in  the  last  term  of  (33),  obtaining 

Substituting  from  (20)  for  P (j  + 1),  and  simplifying  the  result  gives 

XX 

K.[c.  ,Q.C.  ^ + R.  ,]=[F.P  (j)A.^  + N.  ,Q.]C.^/ 

3 J + 1 J J+1  J + 1 J ex  J J + 1 J J + 1 

(35) 

which  insures  that  (32)  may  be  written  as  a homogeneous  equation 

u (j  + 1)  = F . fi(j)  F . ^ (36) 

whenever  is  selected  optimally.  From  the  initial  conditions  of  the 
problem,  n(o)  = 0,  so  (36)  implies  that  q(j)  = 0 for  j>o.  We  therefore 
have  established  that  (29)  and  (35)  will  satisfy  (27)  and  (28),  the  necessary 
conditions  for  optimality.  We  can  write  (29)  and  (35)  as 


Ai^C  . / 

J J + 1 


(S  + l^j^j  + /^^+l^ 


a"*^  N . ^ 

j J + 1 


-C.  .A  .P  (j)  fJ  C.  ,Q  .N 

j + ljxe  JLjJLj  + ljj  + 1 
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K . ' 

L . J = B . 
J J 

f.t 

L J J 


where  Lj  and  Bj  are  defined  as  the  corresponding  matrices  in  (37).  A 
necessary  and  sufficient  condition  for  (38)  to  have  a solution  is  that 


L . L . B . = B . 

J J J J 

t 

where  L.  is  the  pseudo  inverse  of  L..  If  a solution  exists  it  is  of  the 

J 


where  r is  an  arbitrary  matrix. 

It  is  of  interest  to  relate  our  solution  to  the  well  known  Kalman 
filter  solution  when  the  filter  is  not  of  reduced  order,  but  Nj=I.  In 
this  case  (29)  requires  that  Fj  be  selected  as 

which  is  the  same  as  the  Kalman  filter  result.  Using  (41)  in  (35)  we  have 

Noting  that  Po„( j ) - Py.( j ) . snd  defining  the  one  stage  prediction  error 

cG  XG 

variance  matrix  as 


Pee(j*l|j)=A,P^^(J)A.'+Q^ 


I 
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It  is  clear  that  (43)  gives  the  common  expression  for  the  Kalman  gain 


e e 


(j  + llj) 


C.  ' [ C. 

J + 1 J + 1 


e e 


( j + 1|  j: 


R.  ] 

J + 1 


-1 


(44) 


Since  in  the  reduced  order  case,  we  may  not  be  able  to  solve  (40),  or  more 
fundamentally,  may  not  be  able  to  solve  the  TPBVP  specified  in  the  previous 
section,  we  therefore  investigate  a partial  solution  where  Fj  is  selected 
a priori,  and  only  K.  is  optimized. 

J I 

SPECIFIED  F.  SOLUTIONS 
J 

The  idea  that  we  suggest  in  this  section  is  motivated  by  the  fact  that 
if  we  select  Fj  prior  to  optimization,  the  remaining  TPBVP  is  linear,  and  there 
are  a number  of  approaches  available  for  solving  linear  TPBVP's.  If  a solution 
exists,  it  can  be  obtained  in  a predictable  number  of  steps,  depending  on  the 
procedure  used.  Thus  a reasonable  amount  of  off-line  calculation  can  bring 
about  a large  savings  in  on-line  calculations.  The  performance  loss  can  be 
made  small  by  selecting  Fj  in  some  way  which  relates  to  performance,  but  does 
not  require  solving  a TPBVP.  Such  a procedure  was  indicated  in  '5]  where  only 
single  stage  optimization  was  considered.  We  should  also  point  out  that  if  one 

is  only  interested  in  the  estimate  at  the  terminal  time,  the  selection  of  F. 

J 

does  not  affect  the  performance  measure.  This  was  proved  in  [8]  for  continuous 
time  problems.  Hence  the  method  we  propose  here  is  appropriate  for  that 
special  case,  i.e.  the  actual  optimal  performance  will  be  achieved. 

If  F.  is  selected  a priori,  we  note  from  (23)  that  ^ may  be  regarded  as 
J ee 

a known  quantity,  as  is  P^^,  in  the  sense  that  it  may  be  precomputed  without 
regard  for  the  remainder  of  the  problem.  If  (j)  is  nonsingular,  one  may 
solve  for  the  gain  Kj  as 
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Kj  .[a;*  (jM)A^^(J*l)P„(j*l)  * Fj(P^^(j)-NjP,^(j)|A]  (45) 

where  we  have  also  assumed  that  [C . ^ , P ( j + 1)  C . , + R . ] has  an 

J + 1 XX  j + 1 j +1 

inverse.  Substituting  (45)  in  (19)  and  (24)  we  obtain  the  expressions 

(j  + i)  = (i-P,  (j+i)c.  /m.  /)  a p (j)  fJ  + D (j) 

xe  XX  J +1  J + 1 J xe  J 1 (45) 

.Tm  Td  f a j.  ^ \ ^ (j  + 1) 


-Pxx  (j+1)  Ca+i'  M ' P (j  + 1)  A (j  + 1) 
J J + 1 XX  xe 


ee 


Axe(j)-«j'  j P^JJAl))  A^^(jAl)  Fj  AD,  (j) 

-V  Sa/ 


(47) 


where 


0j(J)-  (I  - P„(iAi)  c.^/h.^jF;  (JaDDj^j 


(48) 


A.  P (j)  nJ  F J) 

J XX  J J 


D,(J).[(«j,jA.  -F.  N,)F.a;  (P__^  «A„„.^; 

-''0  ’’xx  <j>  "/  '/»  *ee 


(49) 


and 


M AC.  / (C.  , 1 P (j  + 1)  C.  ^ + R.  ^ ,) 
j + 1 - J + 1 J + 1 XX  J+1  J + 1 


-1 


(50) 


The  boundary  conditions  for  (46)  and  (47)  are  specified  by  (21)  and  (26) 

respectively.  Equations  (46)  and  (47)  are  linear  in  the  matrices  to  be 

computed,  P and  A . Therefore  there  are  a number  of  ways  to  solve  the 
xe  xe 

problem.  One  approach  is  to  assume  a linear  relationship  between  the  elements 


r 


of  P and  A , i.e.  after  putting  the  elements  of  A in  a long  vector  A^^, 
xe  xe  r j xe 

★ 

and  similarly  putting  the  elements  of  in  a vector  P^^  , we  may  assume  that 
"’xe  ‘J)  ■ =*J  ’’xe*  * '‘i 

★ -k 

Recursive  equations  may  then  be  obtained  for  5.  and  6 •,  and  the  TPBVP  is 

J J 

effectively  transformed  into  a single  point  boundary  value  problem. 

We  illustrate  the  above  remarks  by  examining  a special  case  where  it 
is  not  required  to  first  put  the  elements  in  a vector  format.  It  is  assumed 
that  the  matrices  of  (23)  are  all  in  a scalar  form,  i.e. 


F . = f.  I 
J J 

^ee  ^ ~ ^ ee  ^ ^ 

which  requires  that  S and  L)  be  similarly  defined.  Then  (46)  and  (47)  are 
of  the  form 

P (j  + 1)  =0111  (j)  P^  (j)  + a,,  (j)  ^ yp  (j  + 1)  + D,  (j) 

xe  xe  12  xe  i 


^xe  ^ '"21  ^xe  “22  ^xe  °2 


where 


'■11  ''j 

»12  <■**'>  ’’xx  ’’xx 

“21  »ee 

“22  »> 
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A linear  relationship  between  P and  A is  assumed,  i.e. 

xe  xe 


Axe  (j)  = (j)  + 


Substituting  (57)  in  (54)  and  (55)  (jives 


Pxe  + = “11  (j)  ^xe  ^ “12  + ^ ^ “l2 

+ (j) 


fj  Pxe  (j)  + = a2l  (j)  Pxe  ^ “22  (j)  + 1 ^xe 


(57) 


(58) 


(59) 


+ «22  ^ + 1 ^ ^2  ^ 


Solving  for  (j  + 1)  in  (58),  and  substituting  that  result  in  (59)  gives 

■ j ^e  ^ " “21  Pxe  ^ “22  + 1 ^ ^2 

.1  (60) 
'■“22  (j)  "jfi  - “12  =j+l^  1^“11  Pxe  ^“12  +1 

+ Dj  (j)  ] 


If  the  above  is  to  hold  for  arbitrary  P^^,  then  we  must  have 

-1 

Sj  = «2i  (J^  ■'■  “22  --!  4-  1 ' “lo  “,•  x J “ii  (j) 


■j  + 1 


12  “j  + r 
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(61) 


and 


fi.=  <>22  (j)  [I  + =j  + i (I  - “i2  “j  + 1^  ^ “12  + 1 ^ °2  (j) 

'*’“22  'j  + 1 ■ “l2  “j+1^ 

The  terminal  conditions  for  the  above  are 


" M = 0 ; e f,,  - 0 


(62) 


(63) 


13 


Having  solved  backwards  in  time  to  evaluate  .}and{6.},  one  may  then  solve 

J J 

(58)  forward  from  the  initial  condition  specified  by  (21).  Since  is 

known  in  terms  of  P , (45)  may  then  be  used  to  evaluate  the  optimum  gain 
xe 

sequence.  We  note  that  the  algorithm  we  have  suggested  will  fail  if  any  of 
the  required  matrix  inverses  fail  to  exist  any  where  along  the  trajectory. 


EXAMPLE  PROBLEM 


In  this  section  we  demonstrate  the  performance  of  a filter  designed  with 
a specified  set  of  values  for  The  example  is  a simplified  model  of  a 

discretized  inertial  system  as  treated  in  [8].  The  dynamics  are  modeled  as 


^j  + 1 


x . + w . 
3 J 


where  A is  .02,  and  the  observation  model  is 


y-,,=[I  0]x.  +v. 

J + 1 J + 1 j + 1 


The  covariance  matrix  of  Wj  is  assumed  to  be 


' "j  “k  ' ' 0 1 ‘ji* 


and  Vj  is  white  noise  with  covariance  parameter  .1.  The  initial  variance  of 
the  state  vector  is  assumed  to  be  the  identity  matrix.  It  is  desired  to 
estimate  x^  using  a first  order  filter  of  the  form 

"j  t 1 = * *=1  M I'j  * 1 * (f 


From  (10),  g^  must  be  selected  as 
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1 

0 


A 

1 


j 


u 

0 


(68) 


The  filter  parameters  F-  and  K.  are  selected  in  the  following  way.  First  a 

J J 

single  stage  optimization  procedure  is  used  to  obtain  values  for  F.  and  K.  as 

J 

indicated  in  [5].  This  does  not  require  solving  a TPBVP.  The  performance  of 
the  resulting  filter  is  shown  in  Figure  1.  The  value  of  F-  is  maintained,  but 

J 

K.  is  optimized  according  to  the  method  proposed  in  this  paper,  i.e.  by  solving 

>J 

a linear  TPBVP  to  minimize  the  performance  measure 


M-1  ? ? 

J = E { .i:  e2  + } 

j=o 


(69) 


The  results  are  shown  in  Figure  1 where  they  are  compared  with  Kalman  filter 
results  and  single  stage  optimization  results.  It  is  seen  that  performance 
can  be  improved,  relative  to  that  obtained  in  [5],  while  maintaining  the  same 
amount  of  on  line  calculations.  The  increased  number  of  off  line  calculations 
is  within  reason  since  only  a linear  TPBVP  must  be  solved. 

REMARKS  AND  CONCLUSIONS 

In  this  paper  we  have  investigated  the  discrete  version  of  the  reduced 
order  filtering  problem.  The  solution  to  the  general  problem  is  seen  to  be  a 
nonlinear  matrix  TPBVP,  and  hence  of  limited  usefulness.  We  have  shown, 
however,  that  under  certain  conditions  it  is  possible  to  find  a simplified 
solution  to  the  TPBVP,  and  that  this  may  be  of  considerably  less  complexity 
than  the  Kalman  filter.  Another  approach  has  been  presented  where  only  the 
gain  of  the  reduced  order  filter  is  optimally  selected,  the  other  parameters 
having  been  selected  a priori,  either  by  means  of  a single  stage  optimization 
procedure  as  in  the  example  problem,  or  by  some  other  method.  The  motivation 
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behind  this  approach  was  that  a linear  TPBVP  resulted,  which  could  be  solved 
with  a predictable  amount  of  calculation.  The  reduced  order  filters  described 
in  this  paper  are  suited  to  those  situations  where  it  is  important  to  reduce 
the  number  of  on  line  calculations.  The  filter  gains  would  ordinarily  be 
precomputed  and  stored  for  use  in  filtering  the  data. 
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ABSTRACT 

The  linear  reduced  order  filtering  problem  is  formulated  as  a matrix 
two-point  boundary-value  problem  for  systems  with  state  dependent  noise  in 
both  the  dynamic  and  observation  models.  Cases  are  presented  in  which  the 
two-point  boundary-value  problem  simplifies. 
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INTRODUCTION 


U 


Many  systems  are  more  realistically  modeled  as  having  multiplicative 
noise  instead  of  additive  noise.  One  example  occurs  in  the  momentum  exchange 
method  for  regulating  the  angular  procession  of  a rotating  space  craft  [1]. 
There  is  a disturbance  which  depends  on  the  procession  rates.  Another  example 
occurs  in  the  design  of  phase  lock  loops.  The  phase  instability  of  an  oscil- 
lator described  in  rectangular  coordinates  appears  as  white,  state  dependent 
noise  [2].  Multiplicative  noise  also  appears  naturally  in  system  identifica- 
tion problems  [3].  Because  such  problems  are  of  considerable  importance,  there 
has  been  much  research  dealing  with  the  topic  of  filtering  and  control  for 
systems  with  state  dependent  noise  [1-5]. 

If  the  order  of  the  system  is  large,  the  design  of  a corresponding 
filter  of  large  dimension  is  often  problematic  from  the  viewpoint  of  on-line 
computation.  Consequently  there  have  been  many  papers  written  in  the  area  of 
reduced  order  filter  design  [5-9].  In  situations  where  one  is  only  interested 
in  estimating  a lower  order  linear  transformation  of  the  state  vector,  it  is 
reasonable  to  attempt  to  do  this  with  a filter  of  reduced  order.  The  para- 
meters of  the  filter  may  then  be  selected  using  fixed  configuration  optimiza- 
tion methods  [8-13].  The  structure  of  such  a filter  may  then  be  suboptimal , 
but  the  parameters  are  chosen  optimally,  subject  to  the  structural  constraints. 

In  this  paper  we  consider  a discrete  version  of  the  work  presented  in 
[5],  where  a continuous  time  problem  was  formulated  as  a two-point  boundary- 
value  problem  (TPBVP).  The  discrete  problem  is  quite  different  from  the 
continuous  time  problem,  since  some  of  the  filter  variables  to  be  optimized 
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appear  quadratical ly  in  the  discrete  problem  and  only  1 inearly  in  the  continu- 
ous time  problem.  The  problem  considered  here  is  similar  to  that  considered 
in  [13],  however  in  [13]  only  a single  stage  optimization  was  performed,  and 
an  observer  structure  was  required.  Here  the  matrix  minimum  principle  [14] 
is  used  and  optimization  is  performed  over  an  interval.  The  observer  structure 
is  not  required  a-priori,  as  we  allow  a driving  term  in  the  filter  to  remove 
any  a-priori  bias.  Cases  are  presented  where  the  TPBVP  simplifies  considerably. 
In  one  case  only  a single-point  boundary-value  problem  must  oe  solved.  In 
another  case,  a linear  TPBVP  is  obtained  which  can  be  solved  either  by  direct 
use  of  linear  systems  theory  or  by  a Riccati  equation  technique. 

PROBLEM  STATEMENT 

A stochastic  process  is  considered  which  is  modeled  by  a discrete 
equation  with  state  dependent  noise 

x(j  + l)=A(j)x(j)+w(j)  + i X.  (j)r.(j)n(j)  (1) 

i = l ^ ^ 

where  x(j)  is  the  state  vector  at  stage  j,  and  w(j)  and  n(j)  are  zero  mean 
white  noise  vectors.  The  term  x.,-  (j)  is  defined  as 

Xi  (j)  AX.  (j)  - (j)  = X (j)  - E{Xi  (j)}  (2) 

The  observation  model  is  of  the  form 

n _ 

y (j  + 1)  = C (j  + 1)  X (j  + 1)  + V (j  + 1) + 1:;  x.j  (j)  M.  (j)  n(j)  (3) 

i = l 

where  v (j  + 1)  is  zero  mean  additive  white  measurement  noise.  We  note  that 
measurements  of  the  form 

n _ 

y ( j + 1)=  C ( j + 1)  X ( j + 1)  + V ( j +1)  + i Xi  (j  + 1)  Mi  (j +1)  E(j  + 1)  (4) 

1=1 
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T 


where r is  zero  mean  white  noise  can  be  put  in  the  required  form  (3),  if  n and 
are  properly  defined,  and  if  it  is  also  true  that  when  (j  + 1)  multiples 
a non-zero  term  in  the  summation  of  (4),  then  its  dynamical  representation  (1) 
contains  no  noisy  term.  The  covariance  matrices  for  w{j),  v(j),  and  n(j)  are 

E{w(j)w'^(k)}  = Q(j) 

E{v  (j)  vT(k)}  = R(j)  (5) 

Ein  (j)  nT(k)}  = H (j)  6., 

Jk 


These  terms  are  not  correlated  with  each  other  or  with  the  initial  condition 
Xq.  The  known  statistics  of  the  initial  condition  are 


E{  X (o) } = M 

0 

Var{  X (o) } = Pq 


(6) 


In  the  problem  considered,  a lower  order  linear  transformation  of  the  state 
vector  is  considered 


z (j)  = N (j)  X (j)  (7) 

The  dimensions  of  z(j),  x(j),  and  y(j)  are  z,  n,  and  m respectively,  where 
)l  .<  n. 

Since  z(j)  is  of  lower  dimension  than  x(j),  we  consider  estimating  it 
with  a reduced  order  filter  of  the  form 


z(j  + l)  = F{j)z{j)  + K(j)y(j  + l)+g(j)  (8) 

One  objective  of  the  problem  is  to  select  g (j)  and  z (o)  to  satisfy  a require- 
ment that  the  estimate  be  unbiased,  i.e. 


5 

E {e  (j)  } = E {z  (j)  - z (j)}  = o;  '^j>o  (9) 

We  note  that  the  expectations  are  prior  expectations  and  we  are  not  requiring 
that  the  estimate  be  conditionally  unbiased,  which  is  a much  stronger  require- 
ment. The  matrices  F (j)  and  K(j)  are  then  selected  to  minimize  a quadratic 
performance  measure 
N*  1 

J = E(l‘  e’’’ (j)  u{j)  e (j) +e^  (N*)  Se  (N*)  } (10) 

j=o 

The  solution  to  the  unbiased  requirement  is  obtained  by  considering  the  error 
difference  equation.  The  solution  to  the  optimization  problem  is  solved  using 
the  matrix  minimum  principle. 

THE  UNBIftSEO  REQUIREMENT 

The  error  difference  equation  can  be  shown  to  be 

e (j  + 1)  = G (j)  X (j) + F (j)  e (j)  + D (j)  w (j)  - K (j)  v (j  + 1) - g (j) 

n n _ (11 ) 

+ D(j)l  X.  (j)  r (j)  n(j)  . K (j)l  X.  (j)  M.  (j)  n(j) 

1=1  ’ 1 1=1  ^ ^ 

where  we  have  defined  the  matrices 

G(j)=  [N  (j +1)  - K (j)  C (j +1)]  A (j)  - F (j)  N (j)  (12) 

and 

D(j)=  N(j  + 1)-  K(j)  C(j+1)  (13) 

From  (11)  it  is  clear  that  if  g (j)  is  selected  as 

g (j)  = G (j)  „(j)  (14) 


and  if 
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Ho)  = N (o)  , 

then 

E {e  ( j)  I = 0 V-  j >0  (16) 

With  g (j)  selected  according  to  (14),  the  error  difference  equation  may  be 
written  as  ^ 

* 

e (j  + 1)  = G (j)  X (j)  + F (j)  e (j)  + D (j)  w (j)  - K (j)  v (j  + 1) 

n (17) 

+ V X.  (j)  B.  (j)  n(j) 
i = l ^ ^ 

where 

(j)  AD  (j)  r.  (j)  - K(j)M.  (j)  (18) 

and 

X (j)  = X (j)  - u(j)  = X (j)  - E {X  (j)}  (19) 

to  be  consistent  with  (2).  It  is  clear  that  )<  (j)  is  a zero  mean  process 
satisfying 

X (j  + 1)  = A (j)  X (j)  + w (j)  + F.  (j)  r - (j)  n(j)  (20) 

1=1  ' ' 


In  order  to  solve  the  remaining  problem,  it  is  necessary  to  select  K(j) 
and  F (j)  to  minimize  J as  indicated  by  (10)  subject  to  the  constraints 
imposed  by  (17)  and  (20). 
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THE  OPTIMIZATION  PROBLEM 

The  performance  measure  (10)  may  be  written  in  terms  of  the  error 
variance  matrix  as 

★ 

N-1 

J = tr{  1 U(j)  P^g(j)  + SPgg  (N  ))  (21) 

j=o 


where 

- ^ (j)  (j)} 


(22) 


It  is  possible  to  set  up  the  optimization  problem  completely  within  a 
deterministic  framework  if  one  can  find  a set  of  equations  describing  the 
propagation  of  P^^.  We  first  define  the  matrices 

Pxe(j)  = Pex(j)^E{7(j)e^(j)}  (23) 


and 

P^^(j)  AE  {X  (j)  x'^  (j)}  (24) 

Then  from  Equations  (17)  and  (20)  it  can  be  seen  that  the  above  matrices 
satisfy  the  equations 

Pee  (j  + 1)  = F (j)  Pgg  (j)  (j)  + G (j)  P^^  (j)  (j)  + G (j)  Pjj)  (j) 

+ F(j)  Pg^  (j)  G^(j)  + D(j)  Q(j)D'^(j)  + K(j)  R(j  + 1)  K'^(j) 
+ D (j)  I'  (j)  (j)  + K (j)  (j)  (j)  - K (j)  'i'2  (j)  (J) 

- D(j)s'J(j)  k'^(3)  (25) 
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P^g(j  + 1)  = A(j)P^^(j)  G'^(j)  + A(j)  P^g(j)  F^(j)  + Q(j)  D'^(j) 


+ ^<{0)  D^{j)  - ^J(j)  k"^  (j) 


and 


Pxx(j  + 1)  = A (j)  Pxx  (j)  a’’’  (j)  + Q (j)  + t (j) 


where  we  have  used  the  definitions 


n 


E r.  (j)  E(j)  rj(j)  Pxx.^(j) 


i , k=l 
n 


'I'l  (J)  A Z M . (j)  5(j)  (j)  Pxx.,  (j) 

i,k=l  ^ K XX,  I, 


h’k 


and 


Z (j)  H (j)  (j)  Pxx.  (j) 

i,  k=l 

The  initial  conditions  for  (25),  (26),  and  (27)  are 


Pxx 


= P. 


p (o) 
xe  ' ’ 


P 

0 0 


p (o) 
ee  ^ ' 


K P. 

0 0 0 


(26) 


(27) 


(28) 

(29) 


(30) 


(31) 


The  problem  may  be  stated  using  only  deterministic  equations.  It  is  desired 
to  choose  K(j)  and  F (j)  to  minimize  J as  indicated  by  (21)  subject  to  the 
constraints  imposed  by  (25),  (26),  and  (27).  Actually  Px^^j) 
computed  and  considered  a known  sequence  of  matrices.  The  remaining  part 
of  the  problem  is  solved  using  the  matrix  minimum  principle.  The  Hamiltonian 


is  of  the  form 


H.  = tr  (U  ( j ) Pge  ( j ) + Pge  ^ j ^ ^ ee  ^ ^ ^ ^ex  ^ ^ ^ ^ ex  ^ ^ ^ 


(32) 


where  and  are  Lagrange  multiplier  matrices,  and  A^^  is  the  trans- 


ee 

pose  of  A^g 
ing  from  (25),  (26),  and  (27). 


ex 


The  costate  equations  can  be  found  from  (32),  after  substitut- 


(33) 


A.^(j)=  =G^(j)A  (j  + 1)  F(j)+A'^(j)  A (j  + l)  F(j) 

3P71jT“  . 

(34) 


The  terminal  conditions  for  (33)  and  (34)  are 


»ee  ("1=5 


(36) 


A (M  ) = 0 
xe  ' ' 


A necessary  condition  for  optimality  of  the  filter  gain  can  be  found  by  setting 
the  gradient  of  the  Hamiltonian  with  respect  to  K equal  to  zero.  The  result- 
ing expression  is 


Agg(j  + l){N(j  + l)  Pxx(j  + 1)  C^(3  + l)  + F(j)[Pg^(j)  - N(j)  P^x^j)] 
• A^  (j)  (j  + 1)  + N (j  + 1)  'F  J (j)}  + Aex  ^ '^xx  ^ (j  + 1) 

+ '^2^3)}  = Agg(j  + 1)  K(j)  {C(j  + 1)  P^^(j  + 1)  c‘^(j  + l)  + R(j  + 1) 
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Agg  (j  + 1)  F (j)  [ Pgg  (j)  - (j)  (j)  - N (j)  P^g  (j)  + N (.i)  P^^  (.i) 

•N^  (,i)  ] = [Ag^  {,i  + 1)  + Agp  (,i  + 1)  (N  (i  + 1)  - K (i)  C (i  + 1)  ) A (i) 

•[P^^(j)  N^(j)  - P^g(j)]  (37) 

We  have  presented  the  TPBVP  giving  the  necessary  equations  for  an  optimium 
choice  of  K and  F,  the  reduced  order  filter  design  matrices.  In  general  the 
TPBVP  will  be  rather  difficult  to  solve.  There  are  two  cases  we  shall  consider 
in  which  the  problem  simplifies  considerably  and  may  be  solved  with  a reason- 
able amount  of  effort.  In  the  first  case  the  TPBVP  simplifies  into  a single 
point  boundary  value  problem.  In  the  second  case,  the  problem  is  really  mod- 
ified so  that  F is  selected  prior  to  optimization  and  only  K is  optimally 
selected.  The  resulting  TPBVP  is  therefore  linear  and  can  be  solved  by  well 
known  procedures. 


SIMPLIFCATION 

In  the  first  case  we  assume  that  it  is  possible  to  choose  F (j)  in  such 
a way  that 

G(j)  = N(j  + 1)  A(j)  - K(j)  C(j  + 1)  A(j)  - F(j)  N(j)  = o (38) 

Then  from  (34)  and  (35)  we  can  see  that  A^g(j)  is  zero.  If  it  is  also  true 
that 

^^(j)  APgg(j)  - Pg^  (j)  (j)  (39) 


is  zero,  then  (37)  simplifies  to 
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Agg(j  + 1)  F(j)  N(j)  [P^^(j)  N^(j)  - P^e(j)]  = 

• [N  (j  + 1)  - K (j)  C (j  + 1)  ] A (j)  [ (j)  (j)  - P^g  (j)] 


(40) 


Thus  (38)  is  sufficient  to  insure  that  (37)  is  satisfied  under  the  indicated 
conditions.  We  must  investigate  the  requirement  that  f2(j)  is  zero,  and  see 
what  the  implications  are.  Using  (25)  and  (26),  a difference  equation  for 
n(j  + l)  may  be  derived,  i.e. 


i2(j  + l)  = F(j)  s^(j)  FT(j)  +[K(j)  C(j  + l){Q(j)  +H'(j)}  cT(j  + l) 

+ K(j)  {R(j + 1)  + fj(j)}  -N(j  + 1)  {Q(j)  +'i'(j)}CT(j  + l) 

- F(j)  Pex(j)  A’^(j)  cT(j  + i)  - N (j  + 1)  ^2  (j)  (41) 

+ K(j){¥2(j)  cT(j  + l)  + C(j  + 1)  4(j)}]K'^(j) 

We  will  show  that  the  bracketed  term  that  multiplies  K^(j)  in  (41)  is  zero 
when  K is  selected  optimally. 

If  G(j)  is  zero  and  A^g  (j  + 1)  is  nonsingular,  then  Equation  (36) 
becomes 


K(j)  [C  (j  + 1)  P^^  (j  + 1)  CT  (j  + 1)  + R (j  + 1)  + (j)  + ^2  (j)  (j  + 1) 

+ C(j  + 1)  >F^(j)]  = [N(j  + 1)  P^^(j + 1)+  F(j)  Pg^(j)  A^(j)]C^(j  + l) 

+ N(j  + 1)  S'5(j)  - [N(j  + 1)  - K(j)  C(j  + 1)]  A(j)  P^^(j)  PJ  ii) 

C‘^(j  + 1)  (42) 


where  we  have  substituted  for  F (j)  N(j)  from  (38)  in  the  last  equation.  If 
we  further  substitute  from  (27),  replacing  P.^^  (j  + 1)  , we  obtain 

AX 
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K(j)  [C(j  + 1)  {Q(j)  + 'Kj'))  C'^(j  + 1)  + R(j  + 1)  +^^(3)+ 

+ C(j  + 1)  = F(j)  Pg^(j)  aT  (j)  (j  + 1)  + N (j + ])[{¥  (j) 

. (43) 


+ Q(j)  )c''’(j  + l)  + >^2(3)  ] 

Equation  (43)  is  sufficient  to  show  that  (41)  may  be  written  as 

fi(j  + 1)  = F (j)  n (j)  F^  (j)  (44) 

whenever  K(j)  is  selected  optimally.  Since  n(o)  is  zero,  (44)  implies  that 
52(j)  = oV- j^o.  We  have  therefore  developed  alternative  equations  for  selecting 
K(j)  and  F(j),  i.e.  (38)  and  (43).  Furthermore  these  equations  do  not  involve 
the  Lagrange  multipliers  so  that  we  no  longer  have  a TPBVP.  If  we  define 


(45) 


(j)  aC  (j  + 

1) 

O' 

+ v(j)}  c 

^(j  + 1) 

+ C + 1 ) v 2 

(j) 

^2  (j)  4 ['^2  ■' 

■ c ( j + 

1)  { f(j) 

+ Q(j) 

(38)  and  (43) 

can 

be  written  as 

A'^(j)  C^(j  + 

1)1 

1 

1 

N^j) 

(j) 

1 

1 

1 

1 

-C  ( j + 

1)  A(j) 

p,e(J) 

K'^(j)  1 I A‘^(j)  NT(j  + l) 


F'  (j) 


or  equivalently 


kT  (j) 

L(j)  = v(j)  ( 

fT  (j) 

where  L(j)  and  v(j)  are  defined  as  the  corresponding  matrices  in  (48).  A 
necessary  and  sufficient  condition  for  (48)  to  have  a solution  is  that 


L (j)  L'+'(j)  v(j)  = v(j)  (49) 

where  L^(j)  is  the  pseudo  inverse  of  L"^(j).  If  a solution  exists  it  is  of  the 
form 

[K(j):  F(j)  ] = [L'+(j)  v(j)]  T + r*  [I  - L'^(j)  L (j)]^  (50) 

* 

where  r is  an  arbitrary  matrix. 

Since  it  is  clearly  possible  that  we  might  not  be  able  to  solve  (48),  and 
more  generally,  we  might  not  be  able  to  solve  the  TPBVP  specified  in  the  pre- 
vious section,  it  seems  appropriate  to  investigate  a suboptimal  approach.  We 
therefore  investigate  a partial  solution  where  F (j)  is  selected  a priori,  and 
only  K(j)  is  optimized. 

SOLUTIONS  WITH  SPECIFIC  F (j) 

Selecting  F(j)  a priori  has  an  important  consequence,  i.e.  the  TPBVP  to 
be  solved  in  optimizing  K(j)  is  linear.  There  are  a number  of  approaches 
available  for  solving  linear  TPBVP's,  so  that  one  can  solve  for  an  optimal 
filter  gain  with  a reasonable  amount  of  off  line  calculation.  Performance  will 
of  course  be  suboptimal  because  F(j)  is  not  optimized,  however  the  loss  in 
performance  can  be  kept  small  by  selecting  F(j)  in  some  way  which  relates  to 
performance  but  does  not  require  solving  a TPBVP.  A procedure  similar  to  that 
indicated  in  [8]  can  be  used  to  select  F(j)  where  only  single  stage  optimization 
was  considered. 

From  Equation  (33)  it  is  clear  that  with  F(j)  selected  a priori,  may 
be  regarded  as  a known  quantity,  just  as  P is  known.  With  A known  and  non- 

AX  cc 

singular,  one  may  solve  for  K(j)  using  Equation  (36).  The  resulting  expression 


is 


f 


where 

M*(j  + 1)=  [R(j  + 1)+  'f^(j)+  C(j  + 1)  '/(j) 

+ C (j  + 1)  P^^(j  + 1)  (j + 1)] (52) 


If  we  substitute  from  (51)  in  Equations  (26)  and  (24),  we  obtain  the  equations 
P^^(j  + 1)  = [I  - tP^x^J  + 1)  (j  + D + M*(j  + 1)  C(j  + 1)  ] 

•A(j)  P^g(j)  FT(j)  - cT(j  + l)  +'i'J(j)]  M*(j  + 1)  [ ^^(j) 

+ C(j  + 1)  P^^(j  + 1)  ] A^^(j  + 1)  ^■^(j  + l)  + Pj(j)  (53) 


A^e(j)  = AT(j)  [I  - C^(j  + 1)  M*  (j+ 1)  f + C(j  + 1)  P^^(j  + 1)}] 

•A^g(j  + 1)  F(j)  - A^(j)  cT(j  + l)  M*(j  + 1)  C(j  + 1)  A(j)  P^^(j)  FT(j) 
•Agg  (j  + 1)  F(j)  + P2  (j)  (54) 

where  Pj(j)  and  P2(j)  are  defined  as 

Pl(j)4  Pxx(j  + 1)  N^(j  + 1)  - A(j)  P^^(j)  N^(j)  F^(j)  -[P^^{j  + 1) 
•C^{j  + 1)  + Tj(j)]  M*(j  + 1)  [i  C(j  + 1)  Pxx(j  + 1)  + 

•N^(j  + 1)  - C(j  + 1)  A(j)  P^^(j)  N'^(j)  F^(j)]  (55) 
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and 

P„(j)  = [N(j  + 1)  A(j)  - F(j)  N(j)]^  A (j  + 1)  F(j)  - A^(j) 

2 ee 

• C'^(j  + 1)  M*  (j+1)  [ (C  (j  + 1)  (j+1)  + 4-2  (j)  1 N'^(j  + 1)  - C (j+1) 

•A(j)  P^^(j)  N^(j)  F^(j)]  Age  (j  + 1)  Hj)  (56) 

The  boundary  conditions  for  (53)  and  (54)  are  given  by  (31)  and  (35)  respec- 
tively. Since  (53)  and  (54)  are  linear  in  P and  A , there  are  many  ways 

X0  X0 

to  solve  the  TPBVP.  One  approach  is  to  assume  a 1 inear  relationship  between 
the  elements  of  P and  A . We  may  put  the  distinct  elements  of  A in  a 

Xc  Xc  Xvi 

* ★ 

long  vector  A^p,  and  similarly  form  P^„  from  the  distinct  elements  of  P^„.  If 
we  then  assume  the  relationship 

A*  (j)  = V*(j)  P*  (j)  + 6*(j)  (57) 

xe  xe 

then  recursive  equations  may  be  obtained  for  V (j)  and  6 (j)  and  the  problem 
is  transformed  into  a single  point  boundary  value  problem.  We  will  illustrate 
the  procedure  with  a specific  example  where  it  is  not  necessary  to  put  the 
distinct  elements  of  P and  A in  a vector  form. 

X0  Xc 

It  is  assumed  that  the  matrices  of  (23)  are  selected  to  be  of  a scalar 
form,  i.e. 


F(j)  = f(j)  I 
Age  (j)  = ^ee  (j)  1 


(58) 


where  f ana  X are  scalars;  the  terminal  matrix  s must  be  of  the  same  form, 
ee 

We  may  then  reposition  A^^  and  F in  (53)  and  (54)  and  obtain  equations  of  the 
form 


P^g(j  + 1)  = «^i(j)  Pxe(j)  ^ «12(j)  Axe(j  + 1)  + pi(j)  (59) 


t 
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r 

Ne(j>  ■ “21  * “22  <j'  '‘xalJ*"  * "2 

where 

aii(j)Af  (j)  [I  - lP^^(j  + l)  C^(j  + 1)  + 4'J(j)lM*(j  + l)  C{j  + 1)]  A(j) 
ai2(j)  A -A;l(j  + 1)  [P^^(j  + 1)  cT(j+l)  + tj(j)]  M*(j+1)  [f2(j) 

+ C (j  + 1)  P (j  + 1)  ] (61) 

XX 

(j)  A -f2  (j)  Agg  (j  + 1)  (j)  CT  (j  + 1)  M*  (j  + 1)  C (j  + 1)  A (j) 

a22(j)Al'(j)  A'^(j)  n - C^(j  + 1)  M*  (j  + 1)  { '('2  (j)  + C ( j + 1)  P^^  ( j + 1) }] 

A linear  relationship  between  P and  A is  assumed,  i.e. 

XG  XG 

Aj^g(j)  = V (j)  P^g(j)  + 6 (j)  (62) 

Substituting  (62)  in  (59)  and  (60)  gives 

Pxe(j  + 1)  = «il  (j)  ^ m + l)  + a^2  ( ^ ^ ^ ^ 

+ Pl(,1)  (63) 

V(j)  P^g(j)  +e(j)  = ^21  ^3^  ’'xe^3)  + «22^3)  V (j  + 1)  P^e  + D 

+ 0.22  (j)  B(j  + 1)  + P2  (J)  (6-1) 

Solving  for  Pxe(j‘^l)  (63),  and  substituting  that  result  in  (64)  gives 

V (j)  P^g  (j)  + a(j)  = «21  (j)  '’xe  ^3)  + «22  ^3^  ^^3+  D + ^2  (j) 

+ ^22(1)  V (j+  1)  [I  - a^2(3)  (j^  3^^  l^“ll(3)  Pxe^3)  + ai2^3) 


'0(j  + 1)  + (j)  ] 


(65) 
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If  the  above  is  to  hold  for  arbitrary  P , then  we  must  have 

xe 

V (j)  = “22  (j)  V (j  + 1)  [I  - a^2  (j)  V (j  + 1)]  ^ (j)  (66) 

and 

3(j)  = 0(22  (j)  [I  + V(j  + 1)  (I  - aj2(j)  6(j+l) 

+ P2  (j)  + «22  (j)  V (j  + 1)  (I  - (j)  V (j  + 1))'^^  (j)  (67) 

The  terminal  conditions  for  the  above  are 

V (N*)  = 0;  6 (N*)  = 0 (68) 

We  may  thus  compute  V and  b backward  in  time  so  that  is  known  in  terms  of 
P/e'  tie  calculated  forward  in  time  using  Equation  (51)  and 

solving  for  P from  (59). 

Act 

In  this  section  we  consider  a simplified  model  of  a discretized  inertial 
system  as  considered  in  [8].  The  dynamics  are  modeled  as 


X (j  + 1) 


X (j)  + w (j) 


(69) 


where  A is  ,2  and  the  observation  model  is 

y (j  + 1)  = [ 1 + e(j  + 1) , 0 ] X (j  + 1)  + V (j  + 1)  (70) 

where  e(j)  represents  a sequence  of  independent  random  variables  such  that 

Pr  [ E(j)  =1  ] = 

(71) 

Pr  [ e(j)  = -1  ] = 

and  >2  then  is  the  probabilty  that  there  will  be  no  signal  measured  at  a given 
stage.  If  we  define  n(j)  as  e(j+l)  then  (70)  may  be  written  as 


r ■ 

n 
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y ( j + 1)  ==  [1.  o]  x(j  + l)  +v(j+l)  + [x^  (j)  +Ax2(j)]  n(j)  (72) 

where  it  has  been  assumed  that  w^lj)  is  zero.  Equation  (72)  is  of  the  same 
form  as  (3).  The  initial  conditions  for  the  example  have  mean  value  zero  and 
the  initial  variance  matrix  is  the  identity  matrix.  The  variance  of  the 

r 

measurement  noise  is  R = .1,  and  the  variance  of  the  plant  noise  parameter, 
W2(j)  is  1.  It  is  desired  to  estimate  x^  using  a first  order  filter  of  the 
form 

z (j  + 1)  = F (j)  z (j)  + K (j)  y (j  + 1)  (73) 

' There  is  no  driving  term,  g(j),  since  y(j)  is  zero.  The  parameter  F(j)  is 

I not  selected  optimally,  but  selected  using  a one  stage  optimization  procedure 

i as  indicated  in  [8].  The  parameter  K(j)  is  then  selected  by  solving  a linear 

■ TPBVP  as  suggested  in  this  paper.  Optimization  is  with  respect  to  the  perfor- 

! mance  measure 

■ N*-l 

• J = E{  X (j)  + e^  (N*)}  (74) 

[ j = o 

In  Figure  1,  the  performance  of  the  reduced  order  filter,  designed  to  account 
for  the  probabil ity  that  there  will  be  no  signal  present  at  a given  stage,  is 
compared  with  that  of  a second  order  Kalman  filter  designed  using  the  assump- 
tion that  the  parameter  multiplying  x^  (j  + 1)  in  (70)  is  at  its  mean  value  of 
un  i ty . 

j 

SUMMARY  AND  CONCLUSIONS 

In  this  paper  we  have  investigated  the  discrete  reduced  or^  r *-  ■ 
problem  for  systems  with  state  dependent  noise.  The  aenpr,.’ 
to  be  a nonlinear  matrix  TPBVP,  however  the  result  be(  ■ 
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complicated  under  certain  circumstances,  resulting  in  an  initial  value  problem. 
Simplification  also  results  when  some  of  the  filter  parameters  are  selected 
optimally  and  the  others  are  chosen  a priori.  In  this  case  a linear  TPBVP 
results,  which  can  be  solved  by  standard  procedures.  Because  systems  with 
state  dependent  noise  occur  frequently,  we  feel  that  it  is  important  to  con- 
sider the  reduced  order  filtering  problem  for  such  systems.  As  indicated  by 
the  example  problem,  such  models  can  represent  intermittent  observation  data 
as  well  as  a number  of  other  situations.  The  reduced  order  filters  described 
in  this  paper  are  suited  to  those  situations  where  it  is  important  to  reduce 
the  number  of  on  line  calculations,  and  the  gains  may  be  precomputed  and 
stored  for  use  in  the  filtering  process. 


r 

i 

f 


I 

i 
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Abstract 

A feedback  control  is  obtained  for  a linear  random  tracking  problem  with  a 
quadratic  performance  measure.  Both  inputs  to  the  system  and  the  reference  vector, 
which  is  to  be  tracked,  are  capable  of  instantaneous  change. 


INTRODUCTION 

TracKing  problems  are  an  important  category  of 
control  problems  where  the  output  of  a systeri  is 
required  to  follow  a reference  vector.  Such  prob- 
lems have  been  extensively  treated  in  the  litera- 
ture [1].  [2].  In  this  paper,  we  consider  this 
class  of  problems  within  a format  which  allows  the 
reference  vector  to  change  values  suddenly.  Thus 
the  system  may  be  tracking  one  vector,  and  sudden- 
ly it  is  required  to  track  another  vector.  For 
added  generality,  we  also  allow  for  the  possibil- 
ity of  system  inputs  which  can  change  instantly. 

It  is  assumed  that  the  input  and  reference  vectors 
may  change  instantly  and  randomly  in  time,  so  that 
the  problem  is  a stochastic  control  problem.  The 
elements  oT,  these  vectors  are  modeled  as  Markov 
jump  processes  with  a finite  number  of  states  [3]. 
Sworder  introduced  a method  for  solving  related 
stochastic  control  problems  in  [4],  where  the 
parameters  of  a linear  system  were  modeled  as 

•This  research  was  supported  in  part  by  the  Air 
Force  Office  of  Scientific  Research,  under  Grant 
No.  AFOSR-77-3248. 


Markov  jump  processes.  We  apply  similar  methodology 
to  solve  the  random  tracking  problem  in  this  paper. 

PROBLEM  STATEMENT 

Consider  a system  described  by  the  linear  differ- 
ential equation 

x(t)  = Ax(t)  + Bu(t)  + >(t)  (1) 

where  Y(t)  is  a random  input  vector.  The  output 
equation  is  also  linear 

y(t)  = Cx(0  (2) 

The  control  vector,  u{t),  is  to  be  selected  to 
minimize  a quadratic  performance  measure 

J = I E( I" J[y(t)  - g(i)]^Q[y(T)  - n(i)] 

+ u^(t)  Ru(T)jda  I x(t) , r(t)l  (3) 

In  (3),  0(1)  is  the  random  vector  to  bo  tracked, 
and  r(t)  is  an  indicator  function  which  indicates 
the  values  of  ri(t)  and  Y(t).  The  stochastic 
vectors,  y{t)  and  ri(t),are  Markov  jump  processes 
described  using  the  transition  probabilities 

Pr[n(t+A),  y{VA)  - . y^  I h(t),  Y(t)  Pj  , Yj] 

=■  OjjA  + 0(A)  , i j 
= 1 + qj.A  + 0(A)  , i = j (4) 

for  i ;j  = 1 , 2,  . . . , s 
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Thus  m-  ire  drsiqninq  a control  to  track  a signal 
which  IS  capablo  of  instantaneous  change,  and  are 
also  Subject  to  instantaneous  input  disturbances. 
The  controller  at  all  tirios  has  perfect  informa- 
tion, r(t),  which  indicates  the  current  values  of 
the  signal  to  be  tracked  and  the  input  disturban- 
ces. Using  notation  corresponding  to  that  of 
Sworder  [9],  we  denote  the  event 

ln(t).  Y(t)  * Hj,  t[i]  (5) 

Note  that  there  are  only  a finite  number  of  pos- 
sible values  which  can  be  assumed  by  n and  y.  From 
the  definition  (4)  it  is  clear  that 

E ‘’ji  = ° (6) 

i = l 

and  we  will  make  use  of  this  fact  in  the  develop- 
ment of  the  optimal  controller. 

THE  OPTIMAL  SOLUTION 

To  obtain  the  solution  we  follow  steps  similar  to 
those  used  in  [4].  The  Hamiltonion  for  this 
problem  is 

H(x,u)  = j (Cx-n/Q(Cx-n)  + ^aj^Ru 

+X^[Ax  + Bu  + y]  (7) 

whore  X is  a stochastic  Lagrange  Multiplier.  It 
is  necessary  to  minimize  E{H[x(t) , u (t)] |x(t) , 
r(t)}  with  respect  to  the  contorl,  u.  This  leads 
to  the  equation 

u[x(t).  r{t)]  = -R'’B^E{X(t) |x(t) , r(t)}  (8) 

It  is  convenient  to  assume  a linear  form  for  X 
X{t)  = P(t)  x(t)  + a(t)  (9) 

so  that  u(.)  can  be  written  as 
u[x{t).  r(t)]  = -R’’B^[E{P(t)|r(t)}  x(t) 

+E{a{t)|r{t))]A  K(t)  x(t)  + B(t)  (10) 
Substituting  for  u in  (7)  and  differentiating  with 


or,  substituting  from  (1), 

X = (f  + PA  + PBK)x  + PBB  + Py  + a (14) 

If  (11)  and  (14)  are  equated,  and  we  substitute 
for  X from  (9),  then  two  equations  are  obtained: 

P + PA  + PBK  = -C^gC  - K^RK  - (A+BK)^P  (15) 

cVi  - K^RS  - (A+BK)^a  = PBB  + Py  + u (16) 

These  equations  are  required  if  (11)  and  (14)  are 
to  be  equivalent  for  arbitrary  x. 

From  (15)  we  calculate  E{P(t)|r(t)E[j]}.  The  re- 
sulting expression  is 

E{P(t)|r(t)c[j]}  = -Pj(t)A  - A'^Pj(t)  - cV 

+Pj(t)  BR'^B^Pj(t)  (17) 

where  we  have  defined 

Pj(t)  A E{P(t)|r(t)c[j])  (18) 

and  used  the  fact  that 

E{K(t)|r(t)e[j]}  = -R-’B^Pj(t)  (19) 

The  left  hand  side  of  (17)  can  be  calculated  a 
different  way  as 


E(P(t)|r(t) 

c[j]) 

- 

1 im 

E lPlti/'MtJ_cLi2)_-  E ( P(t)|r (t)E[j]} 

A*0 

A 

(20) 

Using 

(4). 

we  then  obtain 

E{P(t) 

|r(t) 

c[j])  = E 

i = l 

P^(t)  + Pj(t) 

(21) 

Hence 

combi 

ning  (17)  and  (21),  there  results 

"j  * t ’it  ■ -'j*  • “'"j  ■ 

(22) 

Noting  that  (22)  gives  the  same  expression  for  all 

P.,  and  using  (6),  we  have 
.•1  • _ 

P = Pj  = -PA  - A^P  - cV  + PBR'^B  P (23) 

j=l,2,...,s 


I 


respect  to. x,  we  obtain  the  differential  equation 
for  X, 

X = -(JqC  y K^RK)x  + C^On  - K^RB  - (A+BK)^X 

(11) 

which  has  terminal  condition, 

X(T)  = 0 (12) 

Another  expression  for  X is  found  using  (9),  i.e. 

i(t)  ■=  P(t)  x(t)  + P(t)  i(t)  + a(t)  (13) 


Equation  (23)  is  fortunate,  since  it  indicates 
that  it  is  only  be  necessary  to  solve  one  Riccati 
equation  and  not  s coupled  Riccati  equations. 

Using  equation  (16),  we  calculateE{n(t)|r(t)c[j]l, 
ohta  ininn 

E(u(t)  |r(t)E[j])  = C^Q'lj  - A^aj(t) -P  (E)Tj 
+ P (t)BR‘’  B^  aj(t)  (24) 
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where  we  have  defined  the  vector 


aj(t)  A E{a(t)|r(t)(,[J]) 
and  noted  that 


(25) 


E{B(t)|r{t)L[j])  = 


(26) 


Proceeding  as  in  obtaining  (21),  we  find  another 


expression. 


E{a(t)|r(t)e[j]}  = Z Q,:  o.(t)  + a.(t)  (27) 

i=l  J . 

Combining  (24)  and  (27),  we  then  find 

s 

= eVt,  - A^a.  -P  y.  +r  - Z Ri,  “i 

J J J J J Ji  I 


j = 1.2 s (28) 

Hence  there  are  s vector  equations  to  bo  solved 


for  the  terms  (“j).  and  these  are  coupled  through 
the  parameters  From  (12)  it  is  clear  that 

the  terminal  conditions  for  (2  3)  and  (28)  are 


P (T)  = 0 


“j(T)  = 0 


(29) 


The  optimal  control  strategy,  is  thus  found  to  be 


u[x(t).  r(t)]  = -R'^B^[P  (t)x(t)  + aj(t)]  (30) 


when  r(t)E[j] 


where  P (t)and  aj(t)are  found  by  solving  (23)  and 
(28)  respectively. 


EXAMPLE 


We  will  consider  a simple  scalar  example  with  an 
input  which  can  change  suddenly.  The  dynamical 
description  is 

X = u + y(t)  (31) 

where  y(t)  has  two  probable  states,  Y(t)  = y*  or 
Y(t)  = 0.  With  y*  a transient  state  and  0 an 
absorbing  state,  we  have 


Pr[y(t+A)  = Y*|y(t)  = y*]  = 1 - qA 

Pr[y('t+A)  = y*|y(t)  = 0 ] = 0 

Pr[Y(t+A)  = 0 |y(t)  = y*]  = qA 

Pr[y(t+A)  = 0 |y(t)  = 0 ] = 1 

where  q > 0.  The  problem  is  an  infinite  horizon 


(32) 


correspondino  to  (22  ) we  have  the  denenerate 
Riccati  solutions, 

P = P, 


Pj  = 1 


= h + 4(q+l)  -q 


(33) 


where  we  have  assumed  that  if  r(t)t[l]],  it 
means  Y(t)  = 0,  and  r(t)r[2]  means  Y(t)  = y*.  We 
have  corresponding  equations  for  and 


= 0 


“1  “1 

--  r*  +(  1 ♦ q)<i2  = 0 


(34) 

(35) 


which  gives 


. Y* 

“2  ■ 1 + q 


(36) 


(37) 


The  optimal  control  is  thus 
u[x(t),  r(t)]  =-  x(t)  - Y»/(l+q)when  r(t)r[2] 

u[x(t),  r(t)]  = -x(t)  when  r(t)Efl] 

Suppose  that  to  begin  with,  r(t)E[2],  then  sub- 
stitution of  (37)  in  (31)  gives 

(38) 

which  describes  the  trajectory  until  y(t)  switches 
to  zero,  at  which  time  we  begin  to  have  a decaying 
exponential  described  by 

X = -X  (39) 

The  results  appear  to  bo  reasonable,  although  they 
are  not  intuitive. 


SUMMARY 

In  this  paper  we  have  applied  the  stochastic 
minimum  principle  to  solve  a random  tracking  pro- 
blem where  the  values  to  be  tracked  were  capable 
of  changing  instantly.  In  addition,  input  dis- 
turbances have  been  modeled  as  Markov  jump 
processes  which  could  suddenly  change  values.  A 
feedback  control  was  found  which  minimized  a 
quadratic  performance  measure.  The  control  law 
was  found  to  be  linear  in  the  state  vector  and  con- 
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